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The present invention provides improved adenovirus vectors and packaging cell lines. One type of improved adenoviral vector 
comprises deletions within the E2b region of the adenoviral genome. These E2b-de]eted virus are used in conjunction with novel cell 
lines that constitutivcly express E2b gene products. The present invention further provides adenoviral vectors deleted for all viral coding 
regions. These "gutted" vectors permit the transfer of large genes to cells as demonstrated herein by the transfer of the dystrophin gene to 
the muscle of mice. The E2b-deleted vectors and the gutted vectors provide improved adenoviral vectors useful for a wide variety of gene 
therapy applications. 
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IMPROVED ADENOVIRUS VECTORS 

FIELD OF THE INVENTION 

The invention relates to improved adenovirus vectors, and more specifically, 
5 adenovirus vectors useful for gene therapy. 

BACKGROUND 

Adenoviruses (Ad) are double-stranded DNA viruses. The genome of adenoviruses 
(--36 kb) is complex and contains over 50 open reading frames (ORFs). These OElFs are 

10 overlapping and genes encoding one protein are often embedded within genes coding for other 
Ad proteins. Expression of Ad genes is divided into an early and a late phase. Early genes 
are those transcribed prior to replication of the genome while late genes are transcribed after 
replication. The early genes comprise Ela, Elb, E2a, E2b, E3 and E4. The E la gene 
products are involved in transcriptional regulation; the El b gene products are involved in the 

15 shut-off of host cell functions and mRNA transport. E2a encodes the a DNA-binding protein 
(DBP): E2b encodes the viral DNA polymerase and preterminal protein (pTP). The E3 gene 
products are not essential for viral growth in cell culture. The E4 region encodes regulatory 
protein involved in transcriptional and post-transcriptional regulation of viral gene expression: 
a subset of the E4 proteins are essential for viral growth. The products of the late genes (e.g., 

20 LI -5) are predominantly components of the virion as well as proteins involved in the 
assembly of virions. The VA genes produce VA RNAs which block the host cell from 
shutting down viral protein synthesis. 

Adenoviruses or Ad vectors have been exploited for the delivery of foreign genes to 
cells for a number of reasons including the fact that Ad vectors have been shown to be highly 

25 effective for the transfer of genes into a wide variety of tissues in vivo and the fact that Ad 
infects both dividing and non-dividing cells; a number of tissues which are targets for gene 
therapy comprise largely non-dividing cells. 

The current generation of Ad vectors suffer from a number of limitations which 
preclude their widespread clinical use including: 1) immune detection and elimination of cells 

30 infected with Ad vectors. 2) a limited carrying capacity (about 8,5 kb) for the insertion of 
foreign genes and regulatory elements, and 3) low-level expression of Ad genfes in cells 
infected with recombinant Ad vectors (generally, the expression of Ad proteins is toxic to 
cells). 
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The latter problem was thought to be solved by using vectors containing deletions m 
the El region of the Ad genome (El gene products are required for viral gene expression and 
replication). However, even with such vectors, low-level expression of Ad genes is observed. 
U is now thought that most mammalian cells contain El -like factors which can substitute for 
5 the missing Ad El proteins and permit expression of Ad genes remaining on the El deleted 
vectors. 

What is needed is an approach that overcomes the problem of low level expression of 
Ad eenes. Such an approach needs to ensure that adenovirus vectors are safe and non- 
immunogenic. 

SUMMARY OF THE INVENTION 

The present invention contemplates tNvo approaches to improving adenovirus vectors. 
The first approach generally contemplates a recombinant plasmid. together with a helper 
adenovirus, in a packaging cell line. The helper adenovirus is rendered safe by utilization of 
1 5 loxP sequences. In the second approach, "damaged" adenoviruses are employed. While the 
"damaged" adenovirus is capable of self-propagation in a packaging cell line, it is not capable 
of expressing certain genes (e.g., the DNA polymerase gene and/or the adenovirus preterminal 
protein gene). 

In one embodiment of the first approach, the presem invention contemplates a 
20 recombinant plasmid. comprising in operable combination: a) a plasmid backbone. 

comprising an origin of replication, an antibiotic resistance gene and a eukaryotic promoter 
element: b) the left and right inverted terminal repeats (ITRs) of adenovirus, said ITRs each 
having a 5- and a 3- end and arranged in a tail to tail orientation on said plasmid backbone; 
c) the adenovirus packaging sequence, said packaging sequence having a S" and a 3' end and 
25 linked to one of said ITRs; and d) a first gene of interest operably linked to said promoter 
element. 

While it is not intended that the present invention be limited by the precise size of the 
plasmid. it is generally desirable that the recombinant plasmid have a total size of between 27 
and 40 kilobase pairs. It is preferred that the total size of the DNA packaged into an EAM 
30 derived from these recombinant plasmids is about the length of the wild-type adenovirus 

uenome (-36 kb). It is well known in the art that DNA representing about 105% of the wild- 
wpe length may be packaged into a viral particle; thus the EAM derived firom recombinant 
plasmid mav contain DNA whose length exceeds by -105% the size of the wild-type genome. 
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The size of the recombinant plasmid may be adjusted using reporter genes and genes of 
interest having various sizes (including the use of different sizes of introns within these genes) 
as well as through the use of irrelevant or non-coding DNA fragment which act as "stuffer" 
fragments {e.g.. portions of bacteriophage genomes). 

in one embodiment of the recombinant plasmid, said 5' end of said packaging 
sequence is linked to said 3' end of said left ITR. In this embodiment, said first gene of 
interest is linked to said 3' end of said packaging sequence. It is not intended that the present 
invention be limited by the nature of the gene of interest; a variety of genes (including both 
cDNA and genomic forms) are contemplated; any gene having therapeutic value may be 
inserted into the recombinant plasmids of the present invention. For example, the transfer of 
the adenosine deaminase (ADA) gene is useful for the treatment of ADA- patients: the 
transfer of the CFTR gene is useful for the treatment of cystitic fibrosis. A wide variety of 
diseases are known to be due to a defect in a single gene. The plasmids. vectors and EAMs 
of the present invention are useful for the transfer of a non-mutated form of a gene which is 
15 mutated in a patient thereby resulting in disease. The present invention is illustrated using 
recombinant plasmids capable of generating encapsidated adenovirus minichromosomes 
(EAMs) containing the dystrophin cDNA gene (the cDNA form of this gene is preferred due 
to the large size of this gene): the dystrophin gene is non-functional in muscular dystrophy 
(MD) patients. However, the present invention is not limited toward the use of the dystrophin 
20 gene for treatment of MD: the use of the utrophin (also called the dystrophin related protein) 
gene is also contemplated for gene therapy for the treatment of MD [Tinsley et ai (1993) 
Curr. Opin. Genet. Dev. 3:484 and (1992) Nature 360:591]; the utrophin gene protein has 
been reported to be capable of functionally substituting for the dystrophin gene [Tinsley and 
Davies (1993) Neuromusc. Disord. 3:539]. As the utrophin gene product is expressed in the 
25 muscle of muscular dystrophy patients, no immune response would be directed against the 
utrophin gene product expressed in cells of a host (including a human) containing the 
recombinant plasmids. Ad vectors or EAMs of the present invention. While the present 
invention is illustrated using plasmids containing the dystrophin gene, the plasmids. Ad 
vectors and EAMs of the present invention have broad application for the transfer of any gene 
30 whose gene product is missing or altered in activity in cells. 

Embodiments are contemplated wherein the recombinant plasmid further comprises a 
second gene of interest. In one embodiment, said second gene of interest is linked to said 3' 
end of said right ITR. In one embodiment, said second gene of interest is a reporter gene. A 
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variet>' of reporter genes are contemplated, including but not limited to £ coli p-galactosidase 
gene, the human placental alkaline phosphatase gene, the green fluorescent protein gene and 
the chloramphenicol acetyltransferase gene. 

As mentioned above, the first approach also involves the use of a helper adenovirus in 
combination with the recombinant plasmid. In one embodimem. the present invention 
contemplates a helper adenovirus comprising i) first and a second loxP sequences, and ii) the 
adenovirus packaging sequence, said packaging sequence having a 5' and a r end. It is 
preferred that said first loxP sequence is linked to the S" end of said packaging sequence and 
said second loxP sequence is linked to said 3' end of said packaging sequence. In one 
embodiment, the helper virus comprises at least one adenovirus gene coding region. 

The present invention contemplates a mammalian cell line containing the above- 
described recombinant plasmid and the above-described helper virus. It is preferred that said 
cell line is a 293-derived cell line. Specifically, in one embodimem. the present invention 
contemplates a mammalian cell line, comprising: a) a recombinant plasmid. comprising, in 
operable combination: i) a plasmid backbone, comprising an origin of replication, an 
antibiotic resistance gene and a eukaryotic promoter element, ii) the left and right inverted 
terminal repeats (ITRs) of adenovirus, said ITRs each having a and a 3' end and arranged 
in a tail to tail orientation on said plasmid backbone, iii) the adenovirus packaging sequence, 
said packaging sequence having a 5' and a 3' end and linked to one of said ITRs. and iv) a 
first gene of interest operably linked to said promoter element: and b) a helper adenovirus 
comprising i) first and a second loxP sequences, and ii) the adenovirus packaging sequence, 
said packaging sequence having a 5* and a 3' end. As noted previously, said helper can 
further comprise at least one adenovirus gene coding region. 

Overall, the first approach allows for a method of producing an adenovirus 
minichromosome. In one embodiment, this method comprises: A) providing a mammalian 
cell line containing: a) a recombinant plasmid. comprising, in operable combination, i) a 
plasmid backbone, comprising an origin of replication, an antibiotic resistance gene and a 
eukar\'Otic promoter element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in a tail to tail orientation 
on said plasmid backbone, iii) the adenovirus packaging sequence, said packaging sequence 
havina a 5' and a 3' end and linked to one of the ITRs. and iv) a first gene of interest 
operably linked to said promoter elemem; and b) a helper adenovirus comprising i) first and i 
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second loxP sequences, ii) at least one adenovirus gene coding region, and iii) the adenovirus 
packaging sequence, said packaging sequence having a 5" and a 3' end; and 
B) growing said cell line under conditions such that said adenovirus gene coding region is 
expressed and said recombinant plasmid directs the production of at least one adenoviral 
5 minichromosome. It is desired that said adenovirus miniclvomosome is encapsidated. 

In one embodiment, the present invention contemplates recovering said encapsidated 
adenovirus minichromosome and. in turn, purifying said recovered encapsidated adenovirus 
minichromosome. Thereafter, said purified encapsidated adenovirus minichromosome can be 
administered to a host {e.g., a mammal). Human therapy is thereby contemplated. 

10 It is not intended that the present invention be limited by the nature of the 

administration of said minichromosomes. All types of administration are contemplated, 
including direct injection (intra-muscular, intravenous, subcutaneous, etc.). inhalation, etc. 

As noted above, the present invention contemplates a second approaches to improving 
adenovirus vectors. In the second approach, "damaged" adenoviruses are employed. In one 

1 5 embodiment, the present invention contemplates a recombinant adenovirus comprising the 
adenovirus E2b region having a deletion, said adenovirus capable of self-propagation in a 
packaging cell line and said E2b region comprising the DNA polymerase gene and the 
adenovirus preterminal protein gene. In this embodiment, said deletion can be within the 
adenovirus DNA polymerase gene. Alternatively, said deletion is within the adenovirus 

20 preterminal protein gene. Finally, the present invention also contemplates embodiments 
wherein said deletion is within the adenovirus DNA polymerase and preterminal protein 
genes. 

The present invention further provides cell lines capable of supporting the propagation 
of Ad virus containing deletions within the E2b region. In one embodiment the invention 

25 provides a mammalian cell line stably and constitutively expressing the adenovirus El gene 
products and the adenovirus DNA polymerase. In one embodiment, these cell lines comprise 
a recombinant adenovirus comprising a deletion within the E2b region, this E2b-deleted 
recombinant adenovirus being capable of self-propagation in the cell line. The present 
invention is not limited by the nature of the deletion within the E2b region. In one 

30 embodiment, the deletion is within the adenoviral DNA polymerase gene. 

The present invention provides cells lines stably expressing E 1 proteins and the 
adenoviral DNA polymerase, wherein the genome of the cell line contains a nucleotide 
sequence encoding adenovirus DNA polymerase operably linked to a heterologous promoter. 
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In a particularly preferred embodiment, the cell line is selected from the group consistmg ot 
the B-6. B-9. C-l. C-4. C-7. C-13. and C-14 cell lines. 

The present invention further provides cell lines which further constitutively express 
the adenovirus preterminal protein (pTP) gene product (in addition to El proteins and DNA 
polvmerase). In one embodiment, these pTP-expressing cell lines comprise a recombinant 
adenovirus comprising a deletion within the E2b region, the recombinant adenovirus being 
capable of self-propagation in the pTP-expressing cell line. In a preferred embodiment, the 
deletion within the E2b region comprises a deletion within the adenoviral preterminal protein 
gene. In another preferred embodiment, the deletion within the E2b region comprises a 
deletion within the adenoviral (Ad) DNA polymerase and preterminal protein genes. 

In a preferred embodiment, the cell lines coexpressing pTP and Ad DNA polymerase, 
contain within their genome, a nucleotide sequence encoding adenovirus preterminal protein 
operablv linked to a heterologous promoter. In the invention is not limited by the nature of 
the heterologous promoter chosen. The art knows well how to select a suitable heterologous 
promoter to achieve expression in the desired host cell (e.g., 293 cells or derivative thereof). 
In a particularly preferred embodiment, the pTP- and Ad polymerase-expressing cell line is 
selected from the group consisting of the C-l, C-4. C-7, CIS, and C-14 cell lines. 

The present invention provides a method of producing infectious recombinant 
adenovirus particles containing an adenoviral genome containing a deletion within the E2b 
region, comprising: a) providing: i) a mammalian cell line stably and constitutively 
expressing the adenovirus El gene products and the adenovirus DNA polymerase: ii) a 
recombinant adenovirus comprising a deletion within the E2b region, the recombinant 
adenovirus being capable of self-propagation in said cell line; b) introducing the recombinant 
adenovirus into the cell line under conditions such that the recombinant adenovirus is 
propauated to form infectious recombinant adenovirus particles; and c) recovering the 
infectLus recombinant adenovirus particles. In a preferred embodiment, the method further 
comprises d) purifying the recovered infectious recombinant adenovirus particles. In yet 
another prefened embodiment, the method further comprises e) administering the purified 
recombinant adenovirus particles to a host which is preferably a mammal and most preferably 
) a human. 

In another preferred embodiment the mammalian cell line employed in the abdve 
method further constitutively expresses the adenovirus preterminal protein. 
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The present invention further provides a recombinant plasmid capable of replicating in 
a bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the polymerase gene, the deletion resulting in reduced polymerase activity. The 
present invention is not limited by the specific deletion employed to reduce polymerase 
activity. In a preferred embodiment, the deletion comprises a deletion of nucleotides 8772 to 
9385 in SEQ ID N0:4. In one preferred embodiment, the recombinant plasmid has the 
designation pApol. In another preferred embodiment, the recombinant plasmid has the 
designation pBHGllApol. 

The present invention also provides a recombinant plasmid capable of replicating in a 
bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the preterminal protein gene, the deletion resulting in the inability to express functional 
preterminal protein without disruption of the VA RNA genes. The present invention is not 
limited b>' the specific deletion employed to render the pTP inactive: any deletion within the 
pTP coding region which does not disrupt the ability to express the Ad VA RNA genes may 
1 5 be employed. In a preferred embodiment, the deletion comprises a deletion of nucleotides 
10.705 to 1 1.134 in SEQ ID N0:4. In one preferred embodiment- the recombinant plasmid 
has the designation pApTP. In another preferred embodiment, the recombinant plasmid has 
the designation pBHGllApTP. 

In a preferred embodiment, the recombinant plasmid containing a deletion with the 
20 pTP region further comprises a deletion within the polymerase gene, this deletion resulting in 
reduced (preferably absent) polymerase activity. The present invention is not limited by the 
specific deletion employed to inactivate the polymerase and pTP genes. In a preferred 
embodiment, the deletion comprises a deletion of nucleotides 8,773 to 9586 and 11,067 to 
12,513 in SEQ ID N0:4. In one preferred embodiment, the recombinant plasmid has the 
25 designation pAXBApolApTPVARNA+tl3, In another preferred embodiment, the recombinant 
plasmid has the designation pBHGl lApolApTPVARNA+tl3. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 A is a schematic representation of the Ad polymerase expression plasmid 
30 pRSV-pol indicating that the Ad2 DNA polymerase sequences are under the transcriptional 
control of the RSV-LTR/promoter element and are flanked on the 3' end by the SV-40 small 
t intron and SV-40 polyadenylation addition site. 
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Figure IB is a schematic representation of the expression piasmid pRSV-pTP 
indicating that the Ad2 preterminal protein sequences are under the transcriptional control of 
the RSV-LTR/promoter element and are flanked on the end by the SV-40 small-t intron 
and SV-40 polyadenylation signals. 
5 Figure 2 is an ethidium bromide-stained gel depicting the presence of Ad pol DNA 

sequences in genomic DNA from LP-293 cells and several hygromycin-resistant cell lines. 
The -750 bp PGR products are indicated by the arrow. 

Figure 3 is an autoradiograph depicting the results of a viral replication- 
complementation assay analyzing the functional activity of the Ad polymerase protein 
10 expressed by LP-293 cells and several hygromycin resistant cell lines. The 8.010 bp HindUl 
fragments analyzed by densitometry are indicated by an arrow. 

Figure 4 is an autoradiograph indicating that cell lines B-6 and C-7 contained a 
smaller and a larger species of Ad polymerase mRNA while LP-293 derived RNA had no 
detectable hybridization signal. The location of the two species of Ad polymerase mRNA are 
15 indicated relative to the 28S and 18S ribosomal RNAs. and the abenant transcript expressed 
by the B-9 cell line is indicated by an arrow. 

Figure 5 is an autoradiograph showing which of the cell lines that received the 
preterminal protein expression plasmid indicated the presence of pRSV-pTP sequences (arrow 
labelled "pTP") and El sequences (arrow labelled "El"). 
20 Figure 6 is an autoradiograph indicating in which of the cell lines transcription of 

preterminal protein is occurring (arrow labelled "~ 3kb"). 

Figure 7 is an autoradiograph indicating that the expression of the Ad-polymerase 
could overcome the replication defect of H5i«M00 at non-permissive temperatures. 

Figure 8 graphically depicts plaque titre for LP-293, B-6 and CI cell lines infected 
25 with wlAdS. H5ts36. or HSamMOO, and the results demonstrate that the C-7 cell line can be 
used as a packaging cell line to allow the high level growth of El. preterminal, and 

polymerase deleted Ad vectors. 

Figure 9 is an autoradiograph showing that the recombinant pol" virus is viable on 
pol-expressing 293 cells but not on 293 cells which demonstrates that recombinant Ad viruses 
30 containinsi the 612 bp deletion found within pApol lack the ability to express Ad polymerase. 

Figure 10 is a schematic representation of the structure of pAdSpdys wherein the two 
inverted adenovirus origins of replication are represented by a left and right invened terminal 
repeat (LITR/RITR). PI and P2 represent location of piobes used for Southern blot analysis. 
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Figure 1 1 graphically illustrates the total number of transducing adenovirus particles 
produced (output) per serial passage on 293 cells, total input virus of either the helper (hpAP) 
or AdSpdys. and the total number of cells used in each infection. 

Figures 12A-B show Southern blot analyses of viral DNA from lysates 3. 6. 9 and 12, 
5 digested with the restriction enzymes BssHlL Nru\ and EcoRW. For the analyses, fragments 
from the C terminus of mus musculus dystrophin cDNA (A) or the N terminus of E coli 
galactosidase (B) were labeled with dCTP^' and used as probes. 

Figures 13A-B show the physical separation of AdSpdys from hpAP virions at the 
third (A) and final (B) stages of CsCl purification. 
10 Figure 14 graphically depicts the level of contamination of AdSpdys EAMs by hpAP 

virions obtained from tlie final stage of CsCl purification as measured by P-galactosidase and 
alkaline phosphatase expression. The ratio of the two types of virions - AdSpdys EAMs 
(LacZ) or hpAP (AP) in each fraction is indicated in the lower graph. 

Figures 15A-B are western blot (immunoblot) analyses of protein extracts from mdx 
15 myoblasts and myotubes demonstrating the expression of P-galactosidase (A) and dystrophin 
(B) in cells infected with AdSpdys EAMs. 

Figures 16A-C depict immunofluorescence of dystrophin expression in wild type 
MM 14 myotubes (A), uninfected mdx (B) and infected mdx myotubes (C). 

Figure 17 is a schematic representation of the MCK/lacZ constructs tested to determine 
20 what portion of the - 3.3 kb DNA fragment containing the enhancer/promoter of the MCK 
gene is capable of directing high levels of expression of linked genes in muscle cells. 

Figure 1 8 is a schematic representation of a GFP/p-gal reporter construct suitable for 
assaying the expression of Cre recombinase in mammalian cells. 

Figure 1 9 is a schematic representation of the recombination event between the loxP 
25 shuttle \ ector and the Ad5t//7001 genome. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

The term "gene" refers to a DNA sequence that comprises control and coding 
30 sequences necessary for the production of a polypeptide or precursor thereof. The polypeptide 
can be encoded by a full length coding sequence or by any portion of the coding sequence so 
long as the desired enzymatic activity is retained. The term "gene" encompasses both cDNA 
and genomic forms of a given gene. 
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The term "wild-type" refers to a gene or gene product which has the characteristics of 
that gene or gene product when isolated from a naturally occurring source. A wild-type gene 
is that which is most frequently observed in a population and is thus arbitrarily designated the 
"normal" or "wild-type" form of the gene. In contrast, the term "modified" or "mutant" refers 
to a gene or gene product which displays modifications in sequence and or functional 
properties {i.e.. altered characteristics) when compared to the wild-type gene or gene product. 
It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that 
they have altered characteristics when compared to the wild-type gene or gene product. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of two 
or more deoxyribonucleotides or ribonucleotides, usually more than three (3). and typically 
more than ten (10) and up to one hundred (100) or more (although preferably between twenty 
and thirty). The exact size will depend on many factors, which in turn depends on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in any 
manner, including chemical synthesis. DNA replication, reverse transcription, or a 

5 combination thereof. 

As used herein, the term "regulatory elemem" refers to a genetic elemem which 
controls some aspect of the expression of nucleic acid sequences. For example, a promoter is 
a regulatory elemem which facilitates the initiation of transcription of an operably linked 
coding region. Other regulatory elements are splicing signals, polyadenylation signals. 

_0 termination signals, etc. (defined infra). 

Transcriptional control signals in eucaryotes comprise "promoter" and "enhancer" 
elements. Promoters and enhancers consist of short arrays of DNA sequences that interact 
specifically with cellular proteins involved in transcription [Maniatis. T. et ai. Science 
236:1237 (1987)]. Promoter and enhancer elements have been isolated from a variety of 
25 eukaryotic sources including genes in yeast, insect and mammalian cells and viruses 

(analogous control elements, i.e.. promoters, are also found in procaryotes). The selection of 
a particular promoter and enhancer depends on what cell type is to be used to express the 
protein of interest. Some eukaryotic promoters and enhancers have a broad host range while 
others are functional in a limited subset of cell types [for review see Voss. S.D. et al. Trends 
30 Biochem. Sci.. 1 1 :287 ( 1 986) and Maniatis. T. et ai . supra ( 1 987)]. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
• containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism (e.g. 
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mammal). DNA sequences necessary for expression in procaryotes include a promoter, 
optionallx an operator sequence, a ribosome binding site and possibly other sequences. 
Eukaryoiic cells are known to utilize promoters, polyadenlyation signals and enhancers. 

The terms "in operable combination", "in operable order" and "operably linked" as 
used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid 
molecule capable of directing the transcription of a given gene and/or the synthesis of a 
desired protein molecule is produced. The term also refers to the linkage of amino acid 
sequences in such a manner so that a functional protein is produced. 

The term "genetic cassette" as used herein refers to a fragment or segment of DNA 
containing a particular grouping of genetic elements. The cassette can be removed and 
inserted into a vector or piasmid as a single unit. A plasmid backbone refers to a piece of 
DNA containing at least plasmid origin of replication and a selectable marker gene (e.g., an 
antibiotic resistance gene) which allows for selection of bacterial hosts containing the plasmid: 
the plasmid backbone may also include a polylinker region to facilitate the insertion of 
genetic elements within the plasmid. When a particular plasmid is modified to contain non- 
plasmid elements {e.g.. insertion of Ad sequences and/or a eukaryotic gene of interest linked 
to a promoter element), the plasmid sequences are referred to as the plasmid backbone. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that 
the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its 
neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5' end" if its 5' phosphate is not linked to the 3" oxygen of a 
mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not linked to a 5' 
phospliate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5* and 3' ends. 

When two different, non-overlapping oligonucleotides anneal to different regions of 
the same linear complementary nucleic acid sequence, and the 3' end of one oligonucleotide 
points towards the 5' end of the other, the former may be called the "upstream" 
oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a point of 
initiation of synthesis when placed under conditions in which primer extension is initiated. 
An oligonucleotide "primer" may occur naturally, as in a purified restriction digest or may be 
produced synthetically. 
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A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary nucleotide fragmem may be 
attached to the end of the primer, with the remainder of the primer sequence being 
substantially complementary to the strand. Non-complementar>' bases or longer sequences can 
be interspersed into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 

"Hybridization" methods involve the amiealing of a complementary sequence to the 
target nucleic acid (the sequence to be detected). The ability of two polymers of nucleic acid 
containing complementary sequences to find each other and anneal through base pairing 
interaction is a well-recognized phenomenon. The initial observations of the "hybridization" 
process by Marmur and Lane. Proc. Nail. Acad. Sci. USA 46:453 (1960) and Doty et al. 
Proc. Satl. Acad. Sci. USA 46:461 (1960) have been followed by the refinement of this 

process into an essential tool of modem biology. 

The complement of a nucleic acid sequence as used herein refers to an oligonucleotide 
which, when aligned with the nucleic acid sequence such that the S" end of one sequence is 
paired uith the 3' end of the other, is in "antiparallel association." Certain bases not 
commonly found in natural nucleic acids may be included in the nucleic acids of the present 
invention and include, for example, inosine and 7-deazaguanine. Complementarity need not 
be perfect: stable duplexes may contain mismatched base pairs or unmatched bases. Those 
skilled in the art of nucleic acid technology can determine duplex stability empirically 
considering a number of variables including, for example, the length of the oligonucleotide, 
base composition and sequence of the oligonucleotide, ionic strength and incidence of 

mismatched base pairs. 

Stability of a nucleic acid duplex is measured by the melting temperature, or "T„." 
The T„ of a particular nucleic acid duplex under specified conditions is the temperature at 
which^on average half of the base pairs have disassociated. The equation for calculating the 
I T„ of nucleic acids is well known in the art. 

The term "probe" as used herein refers to a labeled oligonucleotide which forms a 
duplex structure with a sequence in another nucleic acid, due to complementarity of at least 
one sequence in the probe with a sequence in the other nucleic acid. 
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The term "label" as used herein refers to any atom or molecule which can be used to 
provide a detectable (preferably quantifiable) signal, and which can be attached to a nucleic 
acid or protein. Labels may provide signals detectable by fluorescence* radioactivitv, 
colorimeiry. gravimetry. X-ray diffraction or absorption, magnetism, enzymatic activity, and 
5 the like. 

The terms "nucleic acid substrate" and nucleic acid template" are used herein 
interchangeably and refer to a nucleic acid molecule which may comprise single- or double- 
stranded DNA or RNA. 

"Oligonucleotide primers matching or complementary to a gene sequence" refers to 
10 oligonucleotide primers capable of facilitating the template-dependent synthesis of single or 
double-stranded nucleic acids. Oligonucleotide primers matching or complementary to a gene 
sequence may be used in PCRs, RT-PCRs and the like. 

A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most often 
1 5 present in a given segment of the genes; the consensus sequence is the canonical sequence. 

The term "polymorphic locus" is a locus present in a population which shows variation 
between members of the population (i.e.. the most common allele has a frequency of less than 
0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations seen 
between members of the population (generally taken to be a locus at which the most common 
20 allele exceeds a frequency of 0.95 in the gene pool of the population). 

The term "microorganism" as used herein means an organism too small to be observed 
with the unaided eye and includes, but is not limited to bacteria, viruses, protozoans, fungi, 
and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 
25 microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic, intracellular parasites incapable 
of autonomous replication (i.e.. replication requires the use of the host cell's machinery). 
30 Adenoviruses, as noted above, are double-stranded DNA viruses. The left and right inverted 
terminal repeats (ITRs) are short elements located at the 5' and 3" termini of the linear Ad 
genome, respectively and are required for replication of the viral DNA. The left ITR is 
located between 1-130 bp in the Ad genome (also referred to as 0-0.5 mu). The right ITR is 
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-3.7500 bp to the end of the genome (also referred to as 99.5-100 ™u). The t.o 
,TRs are inverted repeats of each other. For clarity, the left ITR or 5" end is sued define the 
V and 3- ends of the ITRs. The end of the left ITR is located at the extreme 5" end of the 
linear adenoviral genome: picturing the left ITR (LITR) as an arrow extending trom the . 
end of the .enome. the tail of the 5' ITR is located at mu 0 and the head of the left ITR iS 
located at ~ 0.5 mu (further the tail of the left ITR is referred to as the 5" end of the left ITR 
and the head of the left ITR is referred to as the 3" end of the left ITR). The tail of the nght 
or 3- ITR is located at mu 100 and the head of the rigt ITR is located at ~ mu 99.5: the head 
of the rieht ITR is referred to as the 5' end of the right ITR and the tail of the right ITR .s 
referred to as the 3' end of the right ITR (RITR). In the linear Ad genome, the ITRs face 
each other with the head of each ITR pointing inward toward the bulk of the genome. When 
arranged in a "tail to tail orientation" the tails of each ITR (which comprise the 5' end of the 
LITR and the 3" end of the RITR) are located in proximity to one another while the heads of 
each ITR are separated and face outward (see for example, the arrangement of the ITRs m the 
EAM shown in Figure 10 herein). The "adenovirus packaging sequence" refers to the ^ 
sequence which comprises five (AI-AV) packaging signals and is required for encaps.dation 
of the mature linear genome: the packaging signals are located from -194 to 358 bp in the Ad 

genome (about 0.5-1.0 mu). 

The phrase "at least one adenovirus gene coding region" refers to a nucleotide 
sequence containing more than one adenovirus gene coding gene. A "helper adenovirus" or 
"helper virus" refers to an adenovirus which is replication-competem in a particular host cell 
(the host may provide Ad gene products such as El proteins), this replication-competent virus 
is used to supplv in trans functions {e.g., proteins) which are lacking in a second rephcation- 
incompetem virus: the first replication-competent virus is said to "help" the second 
replication-incompetem virus thereby permitting the propagation of the second viral genome 
in the cell containing the helper and second viruses. 

The term "containing a deletion within the E2b region" refers to a deletion of at least 
one basepair (preferably more than one bp and preferably at least 100 and most preferably 
more than 300 bp) within the E2b region of the adenovirus genome. An E2b deletion is a 
deletion that prevents expression of at least one E2b gene product and encompasses deletions 
within exons encoding portions of E2b.specific proteins as well as deletions within promoter 
and leader sequences. 



14- 



wo 98/17783 PCT/US97/19541 

An "adenovirus minichromosome" refers to a linear molecule of DNA containing the 
Ad ITRs on each end which is generated from a plasmid containing the ITRs and one or more 
gene of interest. The term "encapsidated adenovirus minichromosome" or "EAM" refers to an 
adenovirus minichromosome which has been packaged or encapsidated into a viral particle: 
5 plasmids containing the Ad ITRs and the packaging signal are shown herein to produce 

EAMs. When used herein, "recovering" encapsidated adenovirus minichromosomes refers to 
the collection of EAMs from a cell containing an EAM plasmid and a helper virus: this cell 
will direct the encapsidation of the minichromosome to produce EAMs. The EAMs may be 
recovered from these cells by lysis of the cell (e.g., freeze-thawing) and pelleting of the cell 

10 debris to a cell extract as described in Example 1 (Ex. 1 describes the recovery of Ad virus 
from a cell, but the same technique is used to recover EAMs from a cell). "Purifying" such 
minichromosomes refers to the isolation of the recovered EAMs in a more concentrated form 
(relative to the cell lysate) on a density gradient as described in Example 7: purification of 
recovered EAMs permits the physical separation of the EAM from any helper virus (if 

15 present). 

The term "transfection" as used herein refers to the introduction of foreign DNA into 
eukaryotic cells. Transfection may be accomplished by a variety of means known to the art 
including calcium phosphate-DNA co-precipitation, DEAE-dextran-mediated transfection. 
polybrene-mediated transfection, electroporation, microinjection, liposome fusion, lipofection, 
20 protoplast fusion, retroviral infection, and biolistics. 

The term "stable transfection" or "stably transfected" refers to the introduction and 
integration of foreign DNA into the genome of the transfected cell. The term "stable 
transfectant" refers to a cell which has stably integrated foreign DNA into the genomic DNA. 
As used herein, the term "gene of interest" refers to a gene inserted into a vector or 
25 plasmid whose expression is desired in a host cell. Genes of interest include genes having 
therapeutic value as well as reporter genes. A variety of such genes are contemplated, 
including genes of interest encoding a protein which provides a therapeutic function (such as 
the dystrophin gene, which is capable of correcting the defect, seen in the muscle of MD 
patients), the utrophin gene, the CFTR gene (capable of correcting the defect seen in cystitic 
30 fibrosis patients), etc.. 

The term "reporter gene" indicates a gene sequence that encodes a reporter molecule 
(including an enzyme), A "reporter molecule" is detectable in anv detection svstem, 
including, but not limited to enzyme (e.g.. ELISA, as well as enzyme-based histochemical 
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assays), fluorescent, radioactive, and luminescent systems. In one embodiment, the present 
invention contemplates the E. coli p-galactosidase gene (available from Pharmacia Biotech. 
Pistacaiaway. NJ). green fluorescent protein (GFP) (commercially available from Clontech. 
Palo Alto. CA). the human placental alkaline phosphatase gene, the chloramphenicol 
acetyltransferase (CAT) gene: other reporter genes are known to the art and may be 
employed. 

As used herein, the terms "nucleic acid molecule encoding." "DNA sequence 
encodinc." and "DNA encoding" refer to the order or sequence of deoxyribonucleotides along 
a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the 
order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes 
for the amino acid sequence. 

DESCRIPTION OF THE INVENTION 

The present invention provides improved adenovirus vectors for the delivery of 
recombinam genes to cells in vitro and in vivo. As noted above, the present invention 
contemplates two approaches to improving adenovirus vectors. The first approach generally 
contemplates a recombinant plasmid containing the minimal region of the Ad genome 
required for replication and packaging (i.e.. the left and right ITR and the packaging or 4* 
sequence) along with one or more genes of interest; this recombinam plasmid is packaged into 
an encapsidated adenovirus minichromosome (EAM) when grown in parallel with an El- 
deleted helper virus in a cell line expressing the El proteins {e.g.. 293 cells). The 
recombinant adenoviral minichromosome is preferentially packaged. To prevent the 
packaging of the helper virus, a helper virus containing loxP sequences flanking the T 
sequence is employed in conjunction with 293 cells expressing Cre recombinase: Cre-loxP 
mediated recombination removes the packaging sequence form the helper genome thereby 
preventing packaging of the helper during the production of EAMs. In the second approach, 
"damaged" or "deleted" adenoviruses containing deletions within the E2b region are 
employed. While the "damaged" adenovirus is capable of self-propagation in a packaging cell 
line expressing the appropriate E2b protein(s), the E2b-deleted recombinant adenovirus are 
incapable of replicating and expressing late viral gene products outside of the packaging cell 
line. 

In one embodiment, the self-propagating recombinant adenoviruses containing 
deletions in the E2b region of the adenovirus genome. In another embodiment, "gutted" 



16- 



I 



wo 98/17783 PCT/US97/19541 

viruses are contemplated: these viruses lack all viral coding regions. In addition, packaging 

cell lines co-expressing El and E2b gene products are provided which allow the production of 
infectious recombinant virus containing deletions in the El and E2b regions without the use 
of helper virus. 

5 The Description of the Invention is divided into the following sections: I. Self- 

Propagating Adenovirus Vectors: II. Packaging Cell Lines: and III, Encapsidated Adenoviral 
Minichromosomes. 

I. Self-Propagating Adenovirus Vectors 

'0 Self-propagating adenovirus (Ad) vectors have been extensively utilized to deliver 

foreign genes to a great variety of cell types in vitro and in vivo, "Self-propagating viruses" 
are those which can be produced by transfection of a single piece of DNA (the recombinant 
viral genome) into a single packaging cell line to produce infectious virus: self-propagating 
viruses do not require the use of helper virus for propagation. 

Existing Ad vectors have been shown to be problematic in vivo. This is due in part 
because current or first generation Ad vectors are deleted for only the early region 1 (El) 
genes. These vectors are crippled in their ability to replicate normally without the trans- 
complementation of El functions provided by human 293 cells, a packaging cell line [ATCC 
CRL 1573; Graham et ai (1977) J. Gen. Virol. 36:59]. Unfortunately, with the use of high 

20 titres of El deleted vectors, and the fact that there are El -like factors present in many cell 
types, E 1 deleted vectors can overcome the block to replication and express other viral gene 
products [Imperiale et al. (1984) Mol. Cell Biol. 4:867: Nevins (1981) Cell 26:213: and 
Gaynor and Berk (1983) Cell 33:683]. The expression of viral proteins in the infected target 
cells elicits a swift host immune response, that is largely T-cell mediated [Yang and Wilson 

25 (1995) J. Immunol. 155:2564 and Yang et ai (1994) Proc. Natl. Acad. Sci. USA 91:4407]. 

The transduced cells are subsequently eliminated, along with the transferred foreign gene. In 
immuno-incompetent animals. Ad delivered genes can be expressed for periods of up to one 
year [Yang et ai (1994), supra: Vincent et al. (1993) Nature Genetics 5:130: and Yang et ai 
(1995) Proc. Natl. Acad. Sci. USA 92:7257]. 

^0 Another shortcoming of first generation Ad vectors is that a single recombination 

event between the genome of an Ad vector and the integrated El sequences present in 293 
• cells can generate replication competent Ad (RCA), which can readily contaminate viral 
stocks. 
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In order to further cripple viral protein expression, and also to decrease the frequency 
of generating RCA. Uie present invention provides Ad vectors containing deletions in the E2b 
region. Propagation of these E2b-deleted Ad vectors requires cell lines which express the 
deleted E2b gene products. The present invention provides such packaging cell lines and for 
the first time demonstrates that the E2b gene products. DNA polymerase and preterminal 
protein, can be constitutively expressed in 293 cells along with the El gene products. With 
every gene that can be constitutively expressed in 293 cells comes the opportunity to generate 
new versions of Ad vectors deleted for the respective genes. This has immediate benefits; 
increased carrying capacity, since the combined coding sequences of the polymerase and 
preterminal proteins that can be theoretically deleted approaches 4.6 kb and a decreased 
incidence of RCA generation, since two or more independent recombinadon events would be 
required to generate RCA. Therefore, the novel El. Ad polymerase and preterminal protein 
expressing cell lines of the present invention enable the propagation of Ad vectors witii a 
carrying capacity approaching 13 kb. witiiout the need for a contaminating helper virus 
[Mitani et al. (1995) Proc. Natl. Acad. Sci. USA 92:3854]. In addition, when genes critical 
to the viral life cycle are deleted (e.g., the E2b genes), a furtiier crippling of Ad to replicate 
and express other viral gene proteins occurs. This decreases immune recognition of virally 
infected cells, and allows for extended durations of foreign gene expression. The most 
important attribute of El, polymerase, and preterminal protein deleted vectors, however, is 
their inability to express the respective proteins, as well as a predicted lack of expression of 
most of the viral strucmral proteins. For example, the major late promoter (MLP) of Ad is 
responsible for transcription of the late structural proteins LI through L5 [Doerfler. In 
Adanovirus DNA. The Viral Genome and Its Expression (Martinus Nijhoff Publishing Boston. 
1986)]. Though the MLP is minimally active prior to Ad genome replication, the rest of the 
late genes get transcribed and translated from the MLP only after viral genome replication has 
occurred [Thomas and Mathews (1980) Cell 22:523]. This cis-dependem activation of late 
gene transcription is a feature of DNA viruses in general, such as in the growth of polyoma 
and SV-40. Tlie polymerase and preterminal proteins are absolutely required for Ad 
replication (unlike the E4 or protein IX proteins) and thus their deletion is extremely 
detrimental to Ad vector late gene expression. 
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II. Packaging Cell Lines Constitutively Expressing E2b Gene 

Products 

The present invention addresses the Hmitations of current or first generation Ad 
vectors by isolating novel 293 cell lines coexpressing critical viral gene functions. The 
present invention describes the isolation and characterization of 293 cell lines capable of 
constiiuti\ely expressing the Ad polymerase protein. In addition, the present invention 
describes the isolation of 293 cells which not only express the E 1 and polymerase proteins, 
but also the Ad-preterminal protein. The isolation of cell lines coexpressing the El, Ad 
polymerase and preterminal proteins demonstrates that three genes critical to the life cycle of 
Ad can be constitutively coexpressed, without toxicity. 

In order to delete critical genes from self-propagating Ad vectors, the proteins encoded 
by the targeted genes have to first be coexpressed in 293 cells along with the E 1 proteins. 
Therefore, only those proteins which are non-toxic when coexpressed constitutive!}' (or toxic 
proteins inducibly-expressed) can be utilized. Coexpression in 293 cells of the El and E4 
genes has been demonstrated (utilizing inducible, not constitutive, promoters) [Yeh et ai 
(1996) J. Virol. 70:559; Wang et al (1995) Gene Therapy 2:775; and Gorziglia ei ai (1996) 
J. Virol. 70:4173], The El and protein IX genes (a virion structural protein) have been 
coexpressed [Caravokyri and Leppard (1995) J. Virol. 69:6627], and coexpression of the El, 
E4. and protein IX genes has also been described [Krougliak and Graham (1995) Hum. Gene 
Ther. 6:1575], 

The present invention provides for the first time, cell lines coexpressing E 1 and E2b 
gene products. The E2b region encodes the viral replication proteins which are absolutely 
required for Ad genome replication [Doerfler, supra and Pronk et al. (1992) Chromosoma 
102:S39-S45]. The present invention provides 293 cells which constitutively express the 140 
kD Ad-polymerase. While other researchers have reported the isolation of 293 cells which 
express the Ad-preterminal protein utilizing an inducible promoter [Schaack et ai (1995) J. 
Virol. 69:4079], the present invention is the first to demonstrate the high-level, constitutive 
coexpression of the El. polymerase, and preterminal proteins in 293 cells, without toxicity. 
These novel cell lines permit the propagation of novel Ad vectors deleted for the El, 
polymerase, and preterminal proteins. 
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III. Encapsidated Adenoviral Minichromosomes 

The present invention also provides encapsidated adenovirus minichromosome (EAM) 
consisting of an infectious encapsidated linear genome containing Ad origins of replication, 
packaging signal elements, a reporter gene (e.g.. a (J-galactosidase reporter gene cassette) and 
a gene of imerest {e.g., a full length (14 kb) dystrophin cDNA regulated by a muscle specific 
enhancer/promoter). EAMs are generated by cotransfecting 293 cells with supercoiled 
plasmid DNA (e.g.. pAdSpdys) containing an embedded inverted origin of replication (and the 
remaining above elements) together with linear DNA from El -deleted virions expressing 
human placental alkaline phosphatase (hpAP) (a helper virus). All proteins necessary for the 
generation of EAMs are provided in irons from the hpAP virions and the two can be 
separated from each other on equilibrium CsCl gradients. These EAMs are useful for gene 
transfer to a variety of cell types both in vitro and in vivo. 

Adenovirus-mediated gene transfer to muscle is a promising technology for gene 
therapy of Duchemie muscular dystrophy (DMD). However, currently available recombinant 
adenovirus vectors have several limitations, including a limited cloning capacity of -8.5 kb. 
and the induction of a host immune response that leads to transient gene expression of 3 to 4 
weeks in immunocompetem animals. Gene therapy for DMD could benefit from the 
development of adenoviral vectors with an increased cloning capacity to accommodate a full 
length (-14 kb) dystrophin cDNA. This increased capacity should also accommodate gene 
regulatory elements to achieve expression of transduced genes in a tissue-specific manner. 
Additional vector modifications that eliminate adenoviral genes, expression of which is 
associated with development of a host immune response, might greatly increase long term 
expression of virally delivered genes in vivo. The constructed encapsidated adenovirus 
minichromosomes of the present invention are capable of delivering up to 35 kb of non-viral 
exogenous DNA. These minichromosomes are derived from bacterial plasmids containing 
two fused inverted adenovirus origins of replication embedded in a circular genome, the 
adenovirus packaging signals, a Pgalactosidase reporter gene and a full length dystrophin 
cDNA regulated by a muscle specific enhancer/promoter. Tl« encapsidated minichromosomes 
are propagated in vitro by o am-complementation with a replication defective (E1+E3 deleted) 
helper virus. These minichromosomes can be propagated to high titer (>10Vml) and purified 
on CsCl gradients due to their buoyancy difference relative to helper virus. These vectors are 
able to transduce myogenic cell cultures and express dystrophin in myotubes. These results 
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* 

demonstrate that encapsidated adenovirus minichromosomes are useful for cene transfer to 
muscle and other tissues. 

The present invention further provides methods for modifying the above-described 
EAM s> stem to enable the generation of high titer stocks of EAMs with minimal helper virus 
5 contamination. Preferably the EAM stocks contain helper virus representing less than 1%, 
preferably less than 0.1% and most preferably less than 0.01% (including 0.0%) of the final 
viral isolate. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
10 versus the EAM virus. The Cre-loxP excision method is employed to remove the packaging 
signals from the helper virus thereby preventing the packaging of the helper virus used to 
provide in trans viral proteins for the encapsidation of the recombinant adenovirus 
minichromosomes. The second approach to reducing or eliminating helper virus in EAM 
stocks is the use of improve physical methods for separating EAM from helper virus. 

15 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: M 

20 (molar): mM (millimolar): |aM (micromolar); mol (moles); mmol (millimoles); jimol 
(micromoles); nmol (nanomoles): mu or m.u. (map unit); g (gravity); gm (grams): mg 
(milligrams): (micrograms): pg (picograms); L (liters); ml (milliliters); |il (microliters); cm 
(centimeters): mm (millimeters); ^m (micrometers); nm (nanometers): hr (hour); min 
(minute): msec (millisecond); 'C (degrees Centigrade); AMP (adenosine 5' -monophosphate); 

25 cDNA (copy or complimentary DNA); DTT (dithiotheritol); ddH.O (double distilled water); 
dNTP (deoxyribonucleotide triphosphate); rNTP (ribonucleotide triphosphate); ddNTP 
(dideoxyribonucleotide triphosphate); bp (base pair); kb (kilo base pair); TLC (thin layer 
chromatography); tRNA (transfer RNA); nt (nucleotide); VRC (vanadyl ribonucleoside 
complex): RNase (ribonuclease); DNase (deoxyribonuclease); poly A (polyriboadenylic acid); 

30 PBS (phosphate buffered saline); OD (optical density); HEPES (N-[2. 

Hydroxyethyl]piperazine-N-[2-ethanesulfonic acid]): HBS (HEPES buffered saline); SDS 
(sodium dodecyl sulfate); Tris-HCl (tris[Hydroxymethyl]aminomethane-hydrochloride); rpm 
(revolutions per minute); ligation buffer (50 mM Tris-HCl, 10 mM MgCK. 10 mM 

-21 - 



PCTrtJS97/19541 

WO 98/17783 

dithiothreitol. 25 |ig/ml bovine semm albumin, and 26 ^iM NAD+. and pH 7.8): EGTA 
(ethylene glycol-bis(|J-aminoethyl ether) N. N. N". N*-tetraacetic acid): EDTA 
(ethylenediaminetetracetic acid); ELISA (enzyme linked immunosorbant assay): LB (Luria- 
Bertani broth: 10 g tr>'ptone. 5 g yeast extract, and 10 g NaCl per liter. pH adjusted to 7.5 
with IN NaOH); superbroth (12 g tryptone. 24 g yeast extract. 5 g glycerol. 3.8 g KH.PO^ 
and 12.5 g. K.,HP04 per liter); DMEM (Dulbecco's modified Eagle s medium): ABI (Applied 
Biosvstems Inc.. Foster City. CA): Amersham (Amersham Corporation. Arlington Heights. 
IL): ATCC (American Type Culture Collection. Rockville, MY): Beckman (Beckman 
Instruments Inc.. Fullerton CA); BM (Boehringer Mannheim Biochemicals, Indianapolis, IN); 
Bio-101 (Bio-101. Vista, CA); BioRad (BioRad, Richmond, CA); Brinkmann (Brinkmann 
Instruments Inc. Wesbury. NY); BRL. Gibco BRL and Life Technologies (Bethesda Research 
Laboratories. Life Technologies Inc.. Gaithersburg. MD): CRI (Collaborative Research Inc. 
Bedford. MA): Eastman Kodak (Eastman Kodak Co.. Rochester. NY): Eppendorf (Eppendorf. 
Eppendorf North America. Inc.. Madison, WI); Falcon (Becton Dickenson Labware. Lincoln 
Park, NJ): IBI (International Biotechnologies, Inc., New Haven. CT): ICN (ICN Biomedicals. 
Inc.. Costa Mesa. CA); Invitrogen (Invitrogen. San Diego, CA); New Brunswick (New 
Brunswick Scientific Co. Inc.. Edison. NJ); NEB (New England BioLabs Inc.. Beverly, MA); 
NEN (Du Pont NEN Products. Boston. MA); Pharmacia (Pharmacia LKB Gaithersburg, MD); 
Promega (Promega Corporation. Madison. WI); Stratagene (Stratagene Cloning Systems, La 
Jolla, CA): UVP (UVP. Inc.. San Gabreil. CA); USB (United States Biochemical Corp.. 
Cleveland. OH); and Whatman (Whatman Lab. Products Inc. Clifton. NJ). 

Unless otherwise indicated, all restriction enzymes and DNA modifying enzymes were 
obtained from New England Biolabs (NEB) and used according to the manufacturers 
directions. 

m 

EXAMPLE 1 

Generation Of Packaging Cell Lines That Coexpress 
The Adenovirus El And DNA Polymerase Proteins 

0 In this example, packaging cell lines coexpressing Ad El and polymerase proteins 

were described. These cell lines were shown to support the replication and growth of H5ts36, 
an Ad with a temperature-sensitive mutation of the Ad polymerase protein. These 
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polymerase-expressing packaging cell lines can be used to prepare Ad, vectors deleted for the 
El and polymerase functions. 



a) Tissue Culture And Virus Groivth 

LP-293 cells (Microbix Biosystems. Toronto) were grown and serially passaged as 
suggested by the supplier. 

Plaque assays were performed in 60 mm dishes containing cell monolayers at - 90% 
confluency. The appropriate virus dilution in a 2% DMEM solution was dripped onto the 
cells, and the plates incubated at the appropriate temperature for one hour. The virus 
containing media was aspirated, the monolayer was overlaid with 10 mis of a pre-warmed 
EMEM agar overlay solution (0.8% Noble agar. 4% fetal calf serum, and antibiotics) and 
allowed to solidify. After the appropriate incubation time (usually 7 days for incubations at 
38.5°C and 10-12 days for incubations at 32°C), five mis of the agar-containing solution 
containing 1.3% neutral red was overlaid onto the infected dishes and plaques were counted 
the next day. An aliquot of the virus H5ts36 [Freimuth and Ginsberg (1986) Proc. Natl. 
Acad. Sci. USA 83:7816] was utilized to produce high titre stocks after infection of 293 cells 
at32^C. H5ts36 is an Ad5-derived virus defective for viral replication at the nonpermissive 
temperature [Miller and Williams (1987) J. ViroL 61:3630]. 

The infected cells were harvested after the onset of extensive cytopathic effect, 
pelleted by centrifugation and resuspended in 10 mM Tris-CK pH 8.0. The lysate was freeze- 
thawed three times and centrifuged to remove the cell debris. The cleared lysate was applied 
to CsCK step gradients (heavy CsCl at density of 1.45 g/ml. the light CsCl at density of 1.20 
g/ml). uliracentrifuged. and purified using standard techniques [Graham and Prevec (1991) In 
Methods in Molecular Biology^ Vol 7: Gene Transfer and Expression Protocols. Murray 
(ed.), Humana Press, Clifton, NJ, pp. 109-128]. The concentration of plaque forming units 
(pfu) of this stock was determined at 32^*0 as described above. Virion DNA was extracted 
from the high titre stock by pronase digestion, phenol-chloroform extraction, and ethanol 
precipitation. The leakiness of this stock was found to be <1 in 2000 pfu at the non- 
permissive temperature, consistent with previous reports [Miller and Williams (1987), supra], 

b) Plasmids 

The expression plasmid pRSV-pol [Zhao and Padmanabhan (1988) Cell 55:1005] 
contains sequences encoding the Ad2 polymerase mRNA (including the start codon from the 
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exon at map unit 39) under the transcriptional control of the Rous Sarcoma Virus 
LTR/promoter element: the Ad2 DNA polymerase sequences are flanked on the 3" end by the 
SV-40 small t intron and SV-40 polyadenylation addition site (see Fig. lA). Figure lA 
provides a schematic representation of the Ad polymerase expression plasmid pRSV-pol. 
pRSV-pol includes the initiator methionine and amino-lerminal peptides encoded by the exon 
at m.u. 39 of the Ad genome. The location of the PGR primers p602a and p2158c. the two 
Seal 1 kb probes utilized for Northern analyses, the polymerase terminator codon. and the 
polyadenylation site of the IVa2 gene (at 11.2 m.u. of the Ad genome) are indicated. 

The expression plasmid pRSV-pTP [Zhao and Padmanabhan (1988). supra] contains 
sequences encoding the Ad2 preterminal protein (including the amino terminal peptides 
encoded by the exon at map unit 39 of the Ad genome) under the transcriptional control of 
the Rous Sarcoma Virus LTR/promoter element; the pTP sequences are flanked on the 3' end 
bv the SV-40 small-t intron and SV-40 polyadenylation signals (see Figure IB for a schematic 
of pRSV-PTP). Figure IB also shows the location of the £toRV subfragment utilized as a 
probe in the genomic DNA and cellular RNA evaluations, as well as the initiator methionine 
codon present from map unit 39 in the Ad5 genome. The locations of the H5ml90 and 
HSsublQO insertions are shown relative to the preterminal protein open-reading frame. The 
following abbreviations are used in Figure 1 : ORF. open reading frame: small-t. small tumor 

antigen, and m.u., map unit. 

pCEP4 is a plasmid containing a hygromycin expression cassette (Invitrogen). 

pFG140 (Microbix Biosystems Inc. Toronto, Ontario) is a plasmid containing 
sequences derived from Ad5i//309 which contain a deletion/substitution in the E3 region 
[Jones and Shek (1979) Cell 17:683]. pFGHO is infectious in single transfections of 293 
cells and is used as a control for transfection efficiency. 



c) Transfection Of 293 Cells 

LP-293 cells were cotransfected with BamUl linearized pRSV-pol and BamHl 
linearized pCEP4 at a molar ratio of 10:1 using a standard CaPOj precipitation method 
[Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Press. Plainview. NY. pp. 16.33- 16.36]. In addition. 293 cells were cotransfected with or with 
BamWl linearized pRSV-pol. BamWX linearized pRSV-pTP and pCEP using a molar ratio of 
10:1 (non-selectable:selectable plasmids). 
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Fortv-eight hours after transfection. the cells were passaged into media containinu 
liygrom\ cin at 100 |ag/mL. Individual hygromycin resistant colonies were isolated and 
expanded. 

d) Isolation Of Ad Polymerase Expressing 293 Cell Lines 

Twenty hygromycin resistant cell lines were expanded and screened for the ability to 
express the Ad polymerase protein. Initially, the individual cell lines were assayed for the 
ability to support growth of the viral polymerase mutant H5ts36 using the plaque assay 
described in section a) above. It was speculated that constitutive expression of the wild type 
Ad polymerase protein in a clonal population of 293 cells should allow the growth of H5ts36 
at 38.5°C. However, it was unclear if constitutive expression of the Ad polymerase would be 
toxic when coexpressed with the El proteins present in 293 cells. Similar toxicity problems 
have been observed with the Ad ssDBP. and the pTP [Klessig et ai (1984) Mol. Cell. Biol. 
4:1354 and Schaack et ai (1995) J. Virol. 69:4079]. 

Of the twenty hygromycin resistant cell lines isolated, seven were able to support 
plaque formation with H5ts36 at the non-permissive temperature, unlike the parental LP-293 
cells; these cell lines were named B-6. B-9, C-1, C-4, C-7. C-13 and C-14 (see Table 1) (the 
B-6 and B-9 cell lines received only the pRSV-pol and CEP4 plasmids; C-1. C-4. C-7, C-1 3 
and C-14 cell lines received the pRSV-pol, pRSV-pTP and CEP4 plasmids). For the results 
20 shown in Table 1 . dishes (60 mm) of near confluent cells of each cell line were infected with 
the same dilution of H5ts36 at the temperature indicated, overlaid with agar media, and 
stained for plaques as outlined in section a. Passage number refers to the number of serial 
passages after initial transfection with the plasmid pRSV-pol. 

The cell line B-6 produced plaques one day earlier than the other Ad polymerase- 
25 expressing cell lines, which may reflect increased polymerase expression (see below). Cell 
line B-9 demonstrated an increased doubling time whereas each of the other cell lines 
displayed no growth disadvantages relative to the parental LP-293 cells. As shown in Table 
1. even after multiple passages (in some instances up to four months of serial passaging) the 
cells were still capable of H5ts36 plaque formation at the non-permissive temperature, 
30 indicating that the RSV-LTR/promoter remained active for extended periods of time. 

However, the cell lines B-9 and C-1 3 displayed a decreased ability to plaque the virus at.32**C 
as well as at 38.5°C. suggesting that a global viral complementation defect had occurred in 
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these cell lines after extended passaging. The remaining cell lines screened at later passages 
demonstrated no such defect, even after 20 passages {e.g.. cell line B-6. Table 1). 



TABLE 1 



Plaquing Ability Of H5ts36 At The Non-Permissive Temperature Utilizing 293-Ad Polymerase Expressing 

Cell Lines 



Number Of Plaques At: 



Cell Line 


Passage Number 






293 ; 1 


^ /p /\/\ 

>500 


A 
U 


9 1 


>50u 






20 1 


>500 






5 I 


>500 




8-9 [ 


14 1 


90 


1 Q 

18 




5 j 


>500 


>500 


C-l 


n 


>500 


>500 




5 1 


>500 


>500 


C-4 


14 


1 >500 


>500 




r 5 


I >500 


>500 


C-7 


I 14 


>500 


>500 




F 5 


[ >500 


>500 


C-13 


27 


120 


4 




F 5 


1 >500 


>500 


C-14 


1 27 


1 >500 


370 



e) Genomic Analysis Of Ad Polymerase-Expressing Cell Lines 

Genomic DNA from LP-293 cells and each of the seven cell lines able to complement 
H5ts36 at 38.5°C were analyzed by PGR for the presence of pRSV-pol derived sequences. 
Genomic DNA from LP-293 cells and the hygromycin resistant cell lines were harvested 
using standard protocols (Sambrook et aL supra) and 200 ng of DNA from each cell line was 
analyzed bv PGR in a solution containing 2 ng/mL of primers p602a and p2158c. 10 mM 
TrisHCl. pH 8.3. 50 mM KGl. 1.5 mM MgGU, and 0.001% gelatin. The forward primer. 
p602a [5-.TTGATTTTATGTTTCAGGTTC AGGG-3' (SEQ ID N0:2)] is located in the SV- 
40 polyadenylation sequence. The reverse primer p2158c [5'- 

TTACCGGCACACTCGGAGGG-3' (SEQ ID N0:3)] is Ad-sequence specific with the 5' 
nucleotide located at position 3394 of the Ad 5 genome [numbering according to Doerfler 
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(1986) Adenovirus D\A, The Viral Genome and Its Expression, Nijhoff. Boston. MA, pp. 1- 
95]. 

PCR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaiuration at 94°C for 3 min. 3 cycles of denaturation at 94°C 
5 for 30 sec. annealing at 50°C for 30 sec, and extension at ll^'C for 60 sec, followed by 

another 27 cycles with an increased annealing temperature at 56°C, with a final extension at 
72*'C for 10 minutes. PCR products were separated on a 1.0 % agarose gel and visualized 
with eihidium bromide staining (Figure 2). A I kb ladder (Gibco-BRL) was used as a size 
marker, and the plasmid pRSV-pol was used as a positive control. The -750 bp PCR products 
10 are indicated by an arrow in Figure 2. 

As shown in Figure 2, all cell lines capable of H5ts36 plaque formation at 38.5°C 
contained the Ad pol DNA sequences, whereas the LP-293 cells did not yield any 
amplification product with these primers. This result demonstrates that each of the selected 
cell lines stably co-integrated not only the hygromycin resistance plasmid pCEP4. but also 
15 pRSV-pol. 

0 Complementation Of The Replication Defect Of H5ts36 By 
Ad Polymerase Expressing Cell Lines 

The C to T transition at position 7623 of the H5ts36 genome alters the DNA binding 
20 affinity of the Ad polymerase protein, rendering it defective for viral replication at non- 
permissive temperatures [Chen et ciL (1994) Virology 205:364; Miller and Williams (1987) J. 
Virol. 61:3630: and Wilkie et cii (1973) Virology 51:499]. To analyze the functional activity 
of the Ad polymerase protein expressed by each of the packaging cell lines, a viral 
replication-complementation assay was performed. LP-293 cells or the hygromycin resistant 
25 cell lines were seeded onto 60 mM dishes at densities of 2.5-3.0 x 10* per dish, infected with 
H5ts36 at a muhiplicity of infection (MOI) of 10, and incubated for 24 hours at 38.5°C, or 48 
hours at 32**C. Total DNA was harvested from each plate, then 2 jig of each sample were 
digested with //mdllL separated on a 1 .0% agarose gel, transferred to a nylon membrane, and 
hybridized with ^-P-labeled H5ts36 virion DNA. Densitometric analysis of the 8.010 bp 
30 ///wdlll fragment in each lane was performed on a phophoroimager (Molecular Dynamics) 
utilizing a gel image processing system (IP Lab Version 1.5, Sunnyvale, CA). 
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The resuUing autoradiograph is shown in Figure 3. In Figure 3. the lane marked 
"Std." (standard) contains 1 ^g of HmdIII-digested H5ts36 virion DNA. The 8.010 bp 
Hindin fragments analyzed by densitometry are indicated by an anow. 

As shown in Figure 3. H5ts36 had a diminished ability to replicate in LP-293 cells at 
the non-permissive temperature. In contrast, all seven of the previously selected cell lines 
were able to support replication of H5ts36 virion DNA at SS.S^C to levels approaching those 

occurring in LP-293 cells at 32°C. 

A densitometric analysis of the amoum of H5ts36 viral DNA replicated in each of the 
cell lines at permissive and nonpermissive temperatures is presented in Table 2 below. For 
this assay, the relative amounts of the 8.010 bp ffi«dlll fragment were compared. The 
relative levels of H5ts36 virion DNA replication determined by densitometric analysis of the 
8.010 bp HMin fragment isolated from each of the cell line DNA samples. The surface area 
of the 8.010 bp fragmem in 293 cells incubated at 38.5»C was designated as I. and includes 
some replicated H5ts36 virion DNA. The numbers in each column represem the ratio 
between the densities of the 8.010 bp fragment isolated in the indicated cell line and the 
density of the same band present in LP-293 cells at 38.5°C. 

As shown in Table 2. the levels of replication at the permissive temperature were all 
within four-fold of each other, regardless of which cell line was analyzed, but at the non- 
permissive temperature LP-293 cells reveal the H5ts36 replication defect. The viral bands 
that were present in the LP-293 DNA sample at 38.C represented input virion DNA as well as 
low level replication of H5ts36 DNA. which is generated due to the leakiness of the ts 
mutation at the high MOI utilized in this experiment. The Ad polymerase-expressing cell 
lines were all found to be capable of augmenting 
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TABLE 2 

Densitometric Anal ysis Of H5ts36 Replication 



Cell Line 


Ratios Of 8.010 bp HindlU Fragment Generated At: 


32.0*'C 


38.5°C 


LP-293 


42.6 


1.0 


B-6 


27.3 


57.2 




69.0 


15.8 


C-I 


113.6 


34.1 


C-4 


100.2 


65.5 


C.7 


114.9 


75.0 


C-13 


43.1 


42.2 


C.|4 


46.7 


27.8 



H5ls36 genome replication. Although one cell line (B-9) allowed H5ts36 replication to levels 
only 1 6 fold greater than LP-293 cells, this was the same cell line that was obser\'ed to 
display poor growth properties. Each of the remaining Ad polymerase-expressing cell lines 
allowed substantially greater replication of H5ts36 at non-permissive temperatures, compared 
to LP-293 cells (Table 2). An enhancement of replication up to 75 fold above that of LP-293 
cells was observed with the cell line C-7 at 38.5°C. A substantially more rapid onset of viral 
cytopathic effect in Ad polymerase-expressing cell lines was observed at either temperature. . 
These estimates of H5ts36 replication-complementation are conservative, since they have not 
been adjusted for the low level replication of H5ts36 at j&.5°C [Miller and Williams (1987), 
supra]. The leakiness of the H5ts36 mutation could potentially be overcome with the use of a 
virus deleted for the Ad polymerase gene. 

g) RNA Analysis Of Ad Poiymerase-Expressing Cell Lines 

Total RNA was extracted from each of the cell lines using the RNAzol method 
(Teltest. Inc., Friendswood, TX 77546; Chomczynski and Sacchi (1987) Anal. Biochem. 
162:156]. Fifteen micrograms of RNA from each cell line was electrophoresed on a 0.8% 
agarose- formaldehyde gel, and transferred to a Nytran membrane (Schleicher & Schuell) by 
blotting. The filter was UV crosslinked, and analyzed by probing with the two '-P-labeled 1 
kb Seal subfragments of Ad which span positions 6095-8105 of the Ad5 genome (see Figure 
1 A). These two Seal subfragments of the Ad genome are complimentary to the 5' end of the 
Ad polymerase mRNA. The resulting autoradiograph is shown in Figure 4. In Figure 4. the 
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location of the smaller and larger species of Ad polymerase mRNA are indicated relative to 
the 28S and 18S ribosomal RNAs. The aberrant transcript expressed by the B-9 cell line is 

indicated by an arrow. 

The results shown in Figure 4 revealed that the RNA derived from cell lines B-6 and 
C-7 contained two species of RNA. estimated to be -4800 and -7000 nt in length, while LP-293 
derived RNA had no detectable hybridization signal. The presence of two polymerase RNA 
species suggests that the polyadenylation signal of the Ad IVa2 gene (present in the Ad-pol 
construct, see Fig. lA) is being utilized by the cell RNA processing machinery, in addition to 
the SV.40 polyadenylation signal. Similar analysis of RNA derived from the cell lines C-l. 
C-4. C-l 3. and C-l 4 also detected the same two transcripts as those detected in the RNA of 
cell lines B-6 and C-7. but at decreased levels, suggesting that even low levels of Ad 
polymerase mRNA expression can allow for the efficiem replication of polymerase mutants 
such as H5ts36. The cell line B-6 expressed high levels of polymerase transcript and can 
plaque H5ts36 one day earlier than the other cell lines at 38.5°C, suggesting a causal 
relationship. It is interesting to note that the two polymerase transcripts are also detected in 
RNA isolated from cell line B-9, but substantial amounts of a larger RNA transcript (size > 
10 kb) is also present (see Figure 4). The high level production of the aberrant message may 
be related to the increased doubling time previously noted in this cell line. 

h) Transfectability Of Ad Polymerase-Expressing Cell Lines 

The ability of Ad polymerase-expressing 293 cell lines to support production of 
H5ts36 virions after transfection with H5ts36 genome DNA was examined as follows. 293 
cells as well as hygromycin resistant cell lines were grown to near confluency on 60 mm 
dishes and transfected with either 3 \ig of purified H5ts36 virion DNA, or with 3.5 ^lg of the 
plasmid pFGHO (Microbix Biosystems), using the cationic lipid Lipofectamine (Gibco-BRL). 
Cells that received the H5ts36 virion DNA were incubated at 32°C for 14 days, or 38.5°C for 
10 days. The pFG140 transfected cells were incubated at 37.5''C for 10 days. All plates 
were then stained with the neutral red agar overiay and plaques were counted the next day. 
The results are shown in Table 3. 
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TABLE 3 

Transfection Efficiency Of Ad pol- Ex pressing Cell Lines 



5 



10 



Cell line 


- » — 

Number Of Plaques At: 




38.5*^0 




>500 


0 


D-O 


>500 


>500 


B-9 


n.d." 


n.d. 


C-1 


n.d. 


>500 


C.4 


n.d. 


>500 


C-1 


n.d. 


>500 


C-13 


n.d. 


100 


C-14 


n.d. 


>500 



n.d. = not determined. 



'5 The results shown in Table 3 demonstrated that transfection of H5ts36 DNA at the 

non-permissive temperature allows for ample plaque production in all of the Ad polymerase- 
expressing cell lines tested, unlike the parental LP-293 cells. Cell line C-13 was at passage 
number 29, and demonstrated a somewhat decreased ability to generate plaques at this 
extended passage number. These same cell lines are also capable of producing plaques when 

20 transfected with the plasmid pFGHO, a plasmid capable of producing infectious, El 

dependent Ad upon transfection of the parental 293 cells [Ghosh-Choudhury (1986) Gene 
50:161]. These observations suggest that the Ad polymerase expressing cell lines should be 
useful for the production of second generation Ad vectors deleted not only for the El genes, 
but also for the polymerase gene. As shown below in Examples 2 and 3, this is indeed the 

25 case. 
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EXAMPLE 2 

Isolation And Characterization Of Packaging Cell Lines That 
Coexpress The Adenovirus EK DNA Polymerase And Preterminal Proteins 

In Example K packaging cell lines coexpressing Ad El and polymerase proteins were 
described. These cell lines were shown to support the replication and growth of H5ts36, an 
Ad with a temperature-sensitive mutation of the Ad polymerase protein. These polymerase- 
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expressing packaging cell lines can be used to prepare Ad vectors deleted for the El and 
polymerase functions. In this example. 293 cells cotransfected with both Ad polymerase and 
preterminal protein expression plasmids are characterized. Cell lines co-expressing the Ad El. 
polymerase and preterminal proteins can be used to prepare Ad vectors deleted for the El. 
polymerase and preterminal protein (pTP) functions. 

a) Tissue Culture And Virus Propagation 

The use of LP-293 cells (Microbix Biosystems Inc.. Toronto). Ad-polymerase 
expressing cell lines, and plaquing efficiency assays of Ad viruses was conducted as described 
in E.xample 1. All cells were maintained in 10% fetal bovine serum supplemented DMEM 
media (GIBCO) in the presence of antibiotics. The virus HSsubm [Freimuth and Ginsberg 
(1986) Proc. Natl. Acad. Sci. USA 83:7816] has a temperature sensitive (ts) mutation caused 
by a tliree base pair insertion within the amino terminus of the preterminal protein, in addition 
to a deletion of the El sequences (see Figure IB). HSsublOO was propagated and titred at 
32.0''C in LP-293 cells: the leakiness of this stock was less than 1 per 1000 plaque-forming 
units (pfu) at the nonpermissive temperamre of 38.5°C. A lower titer cell lysate contaning 
the virus H5/«190 (which contains a 12 base-pair insertion within the carboxy-terminus of the 
preterminal protein as well as a deletion of the El region, see Figure IB) was provided by Dr. 
P. Freimuth (Freimuth and Ginsberg (1986), supra]. The polymerase and preterminal protein 
expressing cell lines were always maintained in media supplemented with hygromycin 
(Sigma) at 100 ng/mL. 

b) Isolation Of Ad Polymerase And Preterminal Protein 
Expressing 293 Cells 

The C-l. C-4. C-7. C-13, and C-14 cell lines (Ex. 1), which had been cotransfected 
with pRSV-pol. pRSV-pTP and CEP4. were screened for presence of pTP sequences and for 
the ability to support the growth of H5ts36 (ts for the Ad-polymerase), H5ml90. and 
H55I/6100 using plaque assays as described in Example 1. 
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i) Analysis Of Genomic DNA And Cellular RNA 

Cell lines that had received the preterminal protein expression plasmid were screen for 
the presence of pRSV-pTP sequences and El sequences. Total DNA was isolated from the 
LP-293, 8-6 (transfected with pRSV-pol only) or C-7 (cotransfected with pRSV-pol and 
pRSV-pTP) cell lines, two micrograms (^g) of each DNA was codigested with the restriction 
enzymes Xbal and Ba/wHI. electrophoretically separated in a 0.6% agarose gel. and 
transferred onto a nylon membrane. The membrane was UV crosslinked, probed with both a 
1.8 kb Blnl'Xhal fragment (spans the EI coding region) isolated from the plasmid pFGMO, 
and a l.Skb EcoRV aubixdigmtxiX of Ad serotype 5 (spans the preterminal protein coding 
sequences; see Fig. 1 B), both of which were random-primer radiolabeled with ^-P to a specific 
activity greater than 3.0 x 10^ cpm/jig. The membrane was subsequently exposed to X-ray 
film with enhancement by a fluorescent screen. The resulting autoradiograph is shown in 
Figure 5. 

In Figure 5. the preterminal specific sequences migrated as an -1 1.0 kb DNA fragment 
15 while the El containing band migrated as a 2.3 kb DNA fragment. No hybridization of either 
probe to DNA isolated from either the LP-293 or B-6 cell lines was observed As shown in 
Figure 5. only the C-7 cell line genomic DNA had preterminal coding sequences, unlike the 
parental LP-293 cells, or the Ad-polymerase expressing B-6 cells. In addition, all cell lines 
had El specific sequences present at nearly equivalent amounts, demonstrating that the 
20 selection design has not caused the loss of the El sequences originally present in the LP-293 
cells. The results presented in Example 1 demonstrated that both the B-6 and C-7 cell lines 
contain polymerase specific sequences within their genomes, unlike the parental LP-293 cells. 

To confirm that transcription of preterminal protein was occurring, total RNA was 
isolated from each of the cell lines, transferred to nylon membranes, and probed to detect 
25 preterminal protein-specific mRNA transcripts as follows. Total cellular RNA was isolated 
from the respective cell lines and 15 ^g of total RNA from each cell line was transferred to 
nylon membranes. The membranes were probed with the 1.8 kb £coRV radiolabeled 
subfragment of Ad5 (see Fig. IB) complementary to the preterminal protein coding region. 
The resulting autoradiograph is shown in Figure 6. 

As shown in Figure 6. a single mRNA of the expected size (--3 kb in length) is detected 
only in RNA derived from the C-7 cell line. No hybridization was detected in lanes 
containing RNA derived from the LP-293 or B-6 cell lines. In Example 1. it was 
demonstrated that the C-7 cell line also expresses high levels of the Ad polymerase mRNA. 
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Thus, the C-7 cell line constitutively expresses both the Ad polymerase and preterminal 
protein mRNAs along with El transcripts. 

ii) Plaquing Efficiency Of pTP Mutants On pTP- 
Expressing Cell Lines 

The C-7 cell line was screened for the ability to transcomplement the growth of 
preterminal mutant viruses. The virus H5wl90 (contains a 12 base pair insertion located 
within the carboxy-terminus of the preterminal protein) has been shown to have a severe 
growth and replication defect, producing less than 10 plaque-forming units per cell [Freimuth 
and Ginsberg (1986) Proc. Natl. Acad. Sci. USA 83:7816]. The results are summarized in 
Table 4 below. For the results shown in Table 4, LP-293, B-6. or C-7 cells were seeded at a 
density of 2.0-2.5 x 10^ cells per plate. The cells were infected with limiting dilutions of 
Ivsates derived from the preterminal protein-mutant viruses H5i«190 or HSsubm. incubated 
at 38.5°C. and plaques counted after six days. As shown in Table 4, only the C-7 cell line 
could allow efficient plaque formation of H5//il90 at 3fi,5^C (the H5i>il90 lysate used to 
infect the cells was of a low titer, relative to the high titer HSsuhlOO stock), while both the B- 
6 and C-1 cell lines had nearly equivalent plaquing efficiencies when HSsubm was utilized 
as the infecting virus. 

TABLE 4 

Plaquing Efficiency Of Preterminal Protein- Mutant Viruses 

Plaque Titres (pfu/ml) 



I 



Virus 



H5/WI90 



H5.vm/>I00 



Mutation 
Location 

carboxv-terminus 



amino-termmus 



LP-293 

<\ X 10- 



<l X 10' 



B-6 
<l X 10- 



9.0 X 10* 



C.7 

1.4 X 10' 



4.5 X 10* 



As shown in Table 4, when equivalent dilutions of H5/nl90 were utilized, the plaquing 
efficiency of the C-7 cell line was at least 100-fold greater than that of the B-6 or LP-293 
cells. This result demonstrated that the €-7 cell line produces a functional preterminal 
protein, capable of trans-complementing the defect of the H5ml90 derived preterminal 
protein. 

The cell lines were next screened for the ability to trans-complement with the 
temperature-sensitive virus. H5i«M00. at nonpermissive temperatures. HSiwMOO has a codon 
insertion mutation within the amino-terminus of the preterminal protein, as well as an El 
deletion. The mutation is responsible both for a temperature sensitive growth defect, as well 
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as a replication defect [Freimuth and Ginsberg (1986). supra and Schaack ei ai (1995) J. 
Virol. 69:4079]. The plaquing efficiency of the cell line C-? was found to be at least 1000 
fold greater than that of the LP-293 cells (at non-permissive temperatures) (see Table 4). 
Interestingly, the cell line B-6 was also capable of producing large numbers of WSsubm 
derived plaques at 38.5^C, even though it does not express any preterminal protein. This 
result suggested that the high level expression of the polymerase protein was allowing plaqi 
formation of WSsubm in the 8-6 cell line. To examine this possibility, the nature of 
\\5sub 1 00 growth in the various cell lines was examined. 



c) Complementation Of The Replication And Growth Defects 
Of H55II6100 

The cell lines B-6 and C-7 were shown to overcome the replication defect of H5ts36 
(Ex. 1 ). Since the preterminal and polymerase proteins are known to physically interact with 
each other [Zhao and Padmanabhan (1988) Cell 55:1005]. we investigated whether the 
expression of the Ad-polymerase could overcome the replication defect of H5^m6100 at non- 
permissive temperatures using the following replication-complementation assay. 

LP-293. B-6. or C-7 cells were seeded onto 60 mM dishes at a density of 2x10* cells 
per dish and infected the next day with HSsubXQQ at a multiplicity of infection (MOI) of 0.25, 
and incubated at 38.5°C for 16 hours or 32.0°C for 40 hours. The cells from each infected 
plate were then harvested and total DNA extracted as described in Example 1 . Four 
micrograms of each DNA sample was digested with //mdlll. electrophoresed through a 0.7% 
agarose gel. transferred to a nylon membrane, and probed with '-P-labeled H5ts36 virion 
DNA. The resulting autoradiograph is shown in Figure 7. As seen in Figure 7. the 
H55«6100 replication defect when grown in LP-293 cells at 38.5°C is seen; this defect is not 
present when the virus is grown at the same temperature in either B-6 or C-7 cells. 

The results depicted in Figure 7 demonstrates that both cell lines B-6 and C-7 could 
trans-complement the replication defect of WSsubXOQ. This result demonstrated that the 
expression of the Ad polymerase in B-6 cells was able to overcome the preterminal protein- 
mediated replication defect of H5iw6100. While not limiting the present invention to any 
particular mechanism, the ability of Ad polymerase to overcome the preterminal protein- 
mediated replication defect of H5iw6100 may be due to a direct physical interaction of the 
polymerase with the amino-terminus of the HS^wftlOO-derived preterminal protein. 
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m support of this hypothesis, it has also been demonstrated that the H:>subm 
replication defect can be overcome when LP-293 cells were infected with a lOO-fold greater 
amoum of HSsubm. However, complementation of the HSsublOQ replication defect is not 
sufficiem to overcome the growth defect of H55h6100. since temperature shift-up experiments 
have demonstrated that the HSsubm growth defect is not dependent upon viral replication 
[Schaack et al. (1995). supra]. Therefore, the overexpression of the Ad-polymerase must 
have allowed a very low level but detectable production of infectious HSsuhlOO particles in 
the B-6 cell line. The reduced growth of H5i«M00 is therefore not due to a replication 
defect, but rather some other critical activity that the preterminal protein has a role in. such as 
augmention of viral transcription by association with the nuclear matrix [Schaack and Shenk 
(1989) Curr. Top. Microbiol. Immunol. 144:185 and Hauser and Chamberlain (1996) J. Endo. 
149:373]. This was confirmed by assessing the ability of the C-7 cell line to overcome the 
growth defect of HSsublOO utilizing one-step growth assays performed as follows. 

Each of the cell lines (LP-293. B-6 and C-7) were seeded onto 60 mm dishes at 2.0 x 
10* cells/dish. The cell lines were infected at an MOI of 4 with each of the appropriate 
viruses (xvtAdS. H5ts36, or H5i«M00), and incubated at 38.50C for 40 hours. The total 
amoum of infectious virions produced in each 60 mm dish was released from the cell lysates 
bv three cvcles of freeze-thawing, and the titer was then determined by limiting dilution and 
plaque assay on B-6 cells at 38.5»C. The results are summarized in Figure 8. 

The results shown in Figure 8, demonstrated that even though the B-6 cell line allowed 
normal replication and plaque formation of H5.mM00 at 38.5°C (in fact. B-6 cells were 
utilized to determine the plaque litres depicted in Figure 8) they could not allow high level 
growth of H5.«M00 and only produced titres of HSsublOQ equivalent to that produced by the 
LP-293 cells. The C-7 cell line produced 100 fold more virus than the LP-293 or B-6 cells, 
see Figure 8. Encouragingly, the titre of HSsublOO produced by the C-7 cells approached 
titres produced by LP-293 cells infected with wild-type virus. Ad5. When the H5^«M00 
virions produced from infection of the C-7 cells were used to infect LP-293 cells at 38.5''C, 
all virus produced retained the ts mutation (/.... at least a 1000 fold drop in pfu was detected 
when LP-293 cells were respectively infected at 38.5''C vs. 32.0''C). This finding effectively 
) rules out the theoretical possibility that the HSsubm input virus genomes recombined with 
the preterminal protein sequences present in the C-7 cells. 

In addition, the C-7 cell line allowed the high level growth of H5ts36. demonstrating 
that adequate amounts of the Ad-polymerase protein were also being expressed. The C-7 cell 
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line was capable of trans-complementing the growth of both H5ts36 and HSsublOO after 4 
months of serial passaging, demonstrating that the coexpression of the EI. preterminal, and 
polymerase proteins was not toxic. 

These results demonstrate that the constitutive expression of both the polymerase and 
preterminal proteins is not detrimental to normal virus production, which might have occurred 
if one or both of the proteins had to be expressed only during a narrow time period during the 
Ad life cycle. In summary, these results demonstrated that the C-7 cell line can be used as a 
packaging cell line to allow the high level growth of El, preterminal, and polymerase deleted 
Ad vectors. 



EXAMPLE 3 

Production Of Adenovirus Vectors 
Deleted For El And Polymerase Functions 



In order to produce an Ad vectors deleted for EI and polymerase functions, a small, 
frame-shifting deletion was introduced into the Ad-pol gene contained within an El-deleted 
Ad genome. The plasmid pBHGl 1 (Microbix) was used as the source of an El -deleted Ad 
genome. pBHGIl contains a deletion of Ad5 sequences from bp 188 to bp 1339 (0.5-3.7 
m.u.): this deletion removes the packaging signals as well as EI sequences. pBHGl I also 
contains a large deletion within the E3 region (bp 27865 to bp 30995; 77.5-86.2 m.u.). The 
nucleotide sequence of pBHGI 1, is listed in SEQ ID N0:4 [for cross-corrleation between the 
pBHGI I sequence and the Ad5 genome (SEQ ID N0:1), it is noted that nucleotide 8,773 in 
pBHGI 1 is equivalent to nucleotide 7.269 in Ad5]. 

pBHG 1 1 was chosen to provide the Ad backbone because this plasmid contains a large 
deletion within the E3 region (77.5 to 86.2 m.u.) and therefore vectors derived from this 
plasmid permit the insertion of large pieces of foreign DNA. A large cloning capacity is 
important when the pol" vectors is to be used to transfer a large gene such as the dystrophin 
gene (cDNA = 13.6 kb). However, the majority of genes are not this large and therefore 
other Ad backbones containing smaller deletions within the E3 region (e.g.. pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pol" vectors using the su-ategy outlined below. 
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a) Construction Of A Plasmid Containing A Portion Of The 
Adenovirus Genome Containing The Polymerase Gene 

A fragment of the Ad genome containing the pol gene located on pBHGl 1 was 
subcloned to create pBSA-XB. Due to the large size of the Ad genome, this intermediate 
plasmid was constructed to facilitate the introduction of a deletion within the pol gene the pol 
deletion. pBSA-XB was constructed as follows. The polylinker region of pBluescript 
(Stratagene) was modified to include additional restriction enzyme recognition sites (the 
sequence of the modified polylinker is provided in SEQ ID N0:5; the remainder of 
pBluescript was not altered); the resulting plasmid was termed pBSX. pBHGl 1 was digested 
with Xbal and BamHl and the 20.223 kb fragment containing the pol and pTP coding regions 
(E2b region) was inserted into pBSX digested with Xbal and BamHl to generate pBSA-XB. 

b) Construction Of pBHG 1 1 Apol 

A deletion was introduced into the pol coding region contained within pBSA-XB in 
such a manner that other key viral elements were not disturbed (e.g.. the major late promoter, 
the tripartite leader sequences, the pTP gene and other leader sequences critical for normal 
virus viability). The deletion of the pol sequences was carried out as follows. pBSA-XB was 
digested with BspEl and the ends were filled in using T4 DNA polymerase. The B^pEI- 
digested. T4 polymerase filled DNA was then digested with BamHl and the 8809 bp 
fiamHI/fiipEI(filled) fragment ^yas isolated as follows. The treated DNA was run on a 0.6% 
agarose gel (TAE buffer) and the 8809 bp fragment was excised from the gel and purified 
using a QIAEX Gel Extraction Kit according to the manufacturer' s instructions (OIAGEN, 
Chatsworth, CA). 

A second aliquot of pBSA-XB DNA was digested with BspHl and the ends were filled 
in with T4 DNA polymerase. The B^pHI-digested. T4 polymerase filled DNA was then 
digested with BamHl and the 13.679 bp BamHllBspHimtd) fragmem was isolated as 
described above. 

The purified 8809 bp BamHI/BipEKfilled) fragmem and the purified 13,679 bp 
BamHl/BspHUmtd) fragmem were ligated to generate pApol. pApol contains a 612 bp 
deletion within the pol gene (bp 8772 to 9385; numbering relative to that of pBHGll) and 
lacks the 1 1.4 kb BamHl fragment containing the right arm of the Ad genome found within 
• pBHGll. 
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To provide the right arm of the Ad genome, pApol was digested with BamHl followed 

by treatment with calf intestinal alkaline phosphatase. pBHGl 1 was digested with BamHl and 
the 1 1 .4 kb fragment was isolated and purified using a QIAEX Gel Extraction Kit as 
described above. The purified 1 1.4 kb BamHl fragment was ligated to the 
flfl/wHI/phosphatased pApol to generate pBHGl lApol. Proper construction of pBHGll Apol 
was confirmed restriction digestion (//mdlll). 



c) Rescue And Propagation Of AdSApol Virus 

The Ad genome contained within pBHGllApol lacks the packaging signals. 
Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl 1 ApoL this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
pAElspl A, pAElsplB (Microbix) or pAdBglll (pAdBglll is a standard shuttle vector which 
contains 0-1 m.u. and 9-16 m.u. of the adenovirus genome). 

To rescue virus, pBHGl lApol was co-transfected with AdSdllOOl viral DNA . 
Ad5 J/7001 contains a deletion in the E3 region; the E3 deletion contained within AdSdllOOl 
is smaller than the deletion contained within the Ad genome contained within pBHGl 1. It is 
not necessary that AdSdllOOl be used to recover virus; other adenoviruses, including wild- 
type adenoviruses, may be used to rescue of virus from pBHGl 1 Apol. 

It has been reported that the generation of recombinant Ads is more efficient if Ad 
DNA-terminal protein complex (TPC) is employed in conjunction with a plasmid containing 
the desired deletion [Miyake et ai (1996) Proc. Natl. Acad. Sci. USA 93:1320]. Accordingly, 
Ad5dl7001-TPC were prepared as described [Miyake et ai (1996). supra]. Briefly, purified 
AdSdllOOl virions (purified through an isopycnic CsCl gradient centered at 1.34 g/ml) were 
lysed by the addition of an equal volume of 8 M guanidine hydrochloride. The released 
AdSdllOOX DNA-TPC was then purified through a buoyant density gradient of 2.8 M CsCI/4 
M guanidine hydrochloride by centrifugation for 16 hr at 55,000 rpm in a VTi65 rotor 
(Beckman). Gradient fractions containing Ad5rf/7001 DNA-TPC were identified using an 
ethidium bromide spot test and then pooled, dialyzed extensively against TE buffer. BSA was 
then added to a final concentration of 0.5 mg/ml and aliquots were stored at -SO^'C. The 
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Ad5t//700l DNA-TPC was then digested with Seal and then gel-fihered through a Sephadex 
O-50 spin column. 

One hundred nanograms of the digested AdSdllOOl DNA-TPC was mixed vvith 5 ^g 
of pBHGl lApol and used to transfect pol-expressing 293 cells (i.e.. C-7). Approximately 10 
days post-transfection. plaques were picked. The recombinant viruses were plaque purified 
and propagated using standard techniques (Graham and Prevac. supra). Viral DNA was 
isolated, digested with restriction enzymes and subjected to Southern blotting analysis to 
determine the organization of the recovered viruses. Two forms of virus containing the 612 
bp pol deletion were recovered and termed Ad5ApolAE3I and Ad5ApolAE3Il. One form of 
recombinant pol" virus recovered, Ad5ApolAE3I. underwent a double recombination event 
with the Ad5t//7001 sequences and contains the E3 deletion contained within AdSdimX at 
the right end of the genome. The second form of recombinant pol" virus, Ad5ApolAE3II. 
retained pBHGl 1 sequences at the right end of the genome contained the E3 deletion 
found within pBHGl 1). These results demonstrate the production of a recombinant Ad vector 
containing a deletion within the pol gene. 

d) Characterization Of The E1+, pol- Viruses 

To demonstrate that the deletion contained within these two pol" viruses renders the 
virus incapable of producing functional polymerase. Ad5ApolAE3I was used to infect 293 
cells and pol-expressing 293 cells (B-6 and C-7 cell lines) and a viral replication- 
complementation assay was performed as described in Example le. Briefly, 293, B-6 and C-7 
cells were seeded onto 60 mm dishes at a density of 2 x 106 cells/dish and infected with 
HSsubm or Ad5ApolAE3I at an MOI of 0.25. The infected cells were then incubated at 
37°C or 38.5'»C for 16 hours, or at 32.0°C for 40 hours. Cells from each infected plate were 
then harvested and total DNA was extracted. Four micrograms of each DNA sample was 
digested with Hindlll electrophoresised through an agarose gel. transferred to a nylon 
membrane and probed with ^=P-labeled adenoviral DNA. The resulting autograph is shown in 
Figure 9. In Figure 9, each panel shows, from left to right, DNA extracted from 293, B-6 
and C-7 cells, respectively infected with either H5jm6100 or Ad5ApolAE3I (labeled 

AdSAPOL in Fig. 9). 

As shown in Figure 9. the recombinant pol" virus was found to be viable on pol- 
expressing 293 cells but not on 293 cells. These results demonstrates that recombinant Ad 
viruses containing the 612 bp deletion found within pApol lack the ability to express Ad 
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polymerase. These results also demonstrate that B-6 and C-7 cells efficiently complement the 
Apol found within Ad5ApolAE3I and Ad5ApolAE3II. In addition, these results show that 
replication of the ts pTP mutant HSsubm can be complemented by high level expression of 
the Ad polymerase with or without co-expression of pTP. 

Because the expression of early genes is required for the expression of the late gene 
products, the ability of recombinant viruses contaning the Apol deletion to direct the 
expression of late gene products was examined. 293 and C-7 cells were infected with 
Ad5ApolAE3I and 24 hours after infection cell extracts were prepared. The cell extracts were 
serially diluted and examined for the expression of the fiber protein (a late gene product) by 
immunoblot analysis. The immunoblot was performed as described in Example 3C with the 
exception that the primary antibody used was an anti-fiber antibody (FIBER-KNOB obtained 
from Robert Gerard University of Texas Southwestern Medical School). The results of this 
immunoblotting analysis revealed that no fiber protein was detected from 293 cells infected 
with Ad5ApolAE3I (at any dilutuion of the cell extract). In contrast, even a 1:1000 dilution 
of cell extract prepared from C-7 cells infected with Ad5ApolAE3I produced a visible band 
on the immunoblot. Therefore, the pol deletion contained within Ad5ApolAE3I resulted in a 
greater than 1000-fold decrease in fiber production. 

The above results demonstrate that polymerase gene sequences can be deleted from the 
virus and that the resulting deleted virus will only grow on cells producing Ad-polymerase in 
trans. Using the pol-expressing cell lines described herein (t\g.. B-6 and C-7). large 
quantities of the pol' viruses can be prepared. A dramatic shut-down in growth and late gene 
expression is seen when cells which do not express Ad polymerase are infected with the pol' 
viruses. 



e) Generation Of El", Pol" Ad Vectors 

Ad5c//7001 used above to recuse virus containing the polymerase deletion is an El- 
containing virus. The presence of El sequences on the recombinant pol" viruses is 
undesirable when the recombinant virus is to be used to transfer genes into the tissues of 
animals: the El region encodes the transforming genes and such viruses replicate extremely 
well in vivo leading to an immune response directed against cells infected with the El - 
containing virus. 

E 1 " viruses containing tiie above-described polymerase deletion are generated as 
follows. pBHGl 1 Apol is cotransfected into pol-expressing 293 cells (e.g.. B-6 or C-7) along 
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with a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shunle vectors include pAElsplA (Microbix). pAElsplB (Microbix) or pAdBglll. 
The "ene of interest is inserted into the polylinker region of the shuttle vector and this 
plasmid is then cotransfected into B-6 or C-7 cells along with pBHGU Apol to generate a 
recombinant ET. pol" Ad vector containing the gene of interest. 

EXAMPLE 4 
Production Of Adenovirus Vectors 
Deleted For El And Preterminal Protein Functions 

In order to produce an Ad vectors deleted for El and preterminal protein functions, a 
small deletion was introduced into the Ad preterminal protein (pTP) gene contained within an 
El-deleted Ad genome. The plasmid pBHGll (Microbix) was used as the source of an El - 
deleted Ad genome to maximize the cloning capacity of the resulting pTP" vector. However, 
other Ad backbones containing smaller deletions within the E3 region (e.g., pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pTP" vectors using the strategy outlined below. 

a) Construction Of pApTP 

A deletion was introduced into the pTP coding region contained within pBSA-XB (Ex. 
3) in such a manner that other key viral elements were not disturbed {e.g.. the tripartite leader 
sequences, the i-leader sequences, the VA-RNA I and II genes, the 55 kD gene and the pol 
gene). The deletion of the pTP sequences was carried out as follows. pBSA-XB was 
digested with Xbal and EcoRV and the 7.875 kb fragment was isolated as described (Ex. 3). 
Another aliquot of pBSA-XB was digested with Muni and the ends were filled in using T4 
DNA polymerase. The A/««I-digested, T4 polymerase filled DNA was then digested with 
Xbal and the 14.894 kb AZ>oI/MM«I(filled) fragment was isolated as described (Ex. 3). The 
7.875 kb Muni fragmem and the 14.894 kb XballMunim^d) fragment were ligated together 
to generate pApTP. 
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b) Construction Of pBHG 1 1 ApTP 

pApol contains a 429 bp deletion within the pTP gene (bp 10.705 to 1 1.134; 
numbering relative to that of pBHGll) and lacks the 11.4 kb BamHl fragment containing the 
right arm of the Ad genome found within pBHG 1 1 . 

To provide the right arm of the Ad genome. pApTP was digested with BamHl 
followed by treatment with shrimp alkaline phosphatase (SAP: U.S. Biochemicals. Cleveland, 
OH). pBHGl 1 was digested with BamHl and the 11.4 kb fragment was isolated and purified 
using a QIAEX Gel Extraction Kit as described (Ex. 3). The purified 1 1.4 kb BamHl 
fragment was ligated to the 5awHI/phosphatased pApTP to generate pBHGl 1 ApTP. Proper 
construction of pBHGllApTP was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pTP 
Deletion 

The Ad genome contained within pBHGllApTP lacks the Ad packaging signals. 
Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl lApol this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
pAElsplA. pAElsplB (Microbix) or pAdBglll. 

Recombinant Ad vectors containing the pTP deletion which contain a deletion within 
the E3 region are generated by cotransfection of pBHGl 1 ApTP (the gene of interest is 
inserted into the unique Pad site of pBHGl 1 ApTP) with a E3-deleted Ad virus such as 
Ad5dl700 into pTP-expressing 293 cells (e.g,, C-7); viral DNA-TPC are utilized as described 
above in Example 3. 

Recombinant vectors containing the pTP deletion which also contain deletions within 
the El and E3 regions are generated by cotransfection of pBHGllApTP into pTP-expressing 
293 cells (e.g.. C-7) along with a shuttle vector containing the left-end Ad5 sequences 
including the packaging signals. Suitable shuttle vectors include pAElsplA (Microbix), 
pAElsplB (Microbix) or pAdBglll. The gene of interest is inserted into the poly linker region 
of the shuttle vector and this plasmid is then cotransfected into B-6 or C-7 cells along with 
pBHGl lApTP to generate a recombinant El", pTP" Ad vector containing the gene of 
interest. 
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EXAMPLE 5 

Production Of Adenovirus Vectors Deleted For 
EL Polymerase And Preterminal Protein Functions 



In order to produce an Ad vectors deleted for El, polymerase and preterminal protein 
functions, a deletion encompassing pol and pTP gene sequences was introduced into the Ad 
sequences contained within an El -deleted Ad genome. The plasmid pBHGl 1AE4 was used 
as the source of an El -deleted Ad genome to maximize the cloning capacity of the resulting 
pop, pTP" vector. pBHGllAE4 is a modified form of BHGll which contains a deletion of 
all E4 genes except for the E4 ORF 6; the E4 region was deleted to create more room for the 
insertion of a gene of interest and to further disable tlie virus. However, other El" Ad 
backbones, such as pBHGl 1 and pBHGl (Microbix; pBHGlO contains a smaller deletion 
within the E3 region as compared to pBHGll), may be employed for the construction of 
pol". pTP" vectors using the strategy outlined below. 

Due to the complexity of the cloning steps required to introduce a 2.3 kb deletion that 
removes portions of both the pTP and pol genes, this deletion was generated using several 
steps as detailed below. 

a) Construction Of pAXBApolApTPVARNA+tl3 

In order to create a plasmid containing Ad sequences that have a deletion within the 
pol and pTP genes, pAXBApolApTPVARNA+tl3 was constructed as follows. 

pBSA-XB was digested with BspEl and the 1 8 kb fragment was isolated and 
recircularized to create pAXBApolApTP; this plasmid contains a deletion of the sequences 
contained between the BspEl sites located at 8J73 and 12,513 (numbering relative to 
5 pBHGll). 

A fragment encoding the VA-RNA3 sequence and the third leader of the tri-partite 
leader sequence was prepared using the PGR as follows. The PGR was carried out in a 
solution containing H5ts36 virion DNA (any Ad DNA, including wild-type Ad, may be used), 
2 ng/mL of primers 4005E and 4006E. 10 mM Tris HCl, pH 8.3. 50 mM KCl. 1.5 mM 
0 MgCl.. 0.001% gelatin and Pfu polymerase. The forward primer. 4005E, [5'- 

TGCCGCAGCACCGGATGCATCo* (SEQ ID N0:6)] contains sequences complementary to 
residues 12,551 to 12,571 of pBHGll (SEQ ID N0:4). The reverse primer. 4006E, [5'- 
GCGTCCGGAGGCTGCCATG CGGCAGGGo' (SEQ ID N0:7)] is complementary to 
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residues 1 1.091 to 1 1.108 of pBHGl 1 (SEQ ID N0:4) as well as a BspEl site (underlined). 
The predicted sequence of the -1.6 kb PGR product is listed in SEQ ID N0:8. 

PGR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaturation at 94*'G for 3 min. 3 cycles of denaturation at 94°G 
for 30 sec. annealing at SO'G for 30 sec. and extension at 72°G for 60 sec. followed by 
another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
72''G for 10 minutes. The -1.6 kb PGR product was purified using a QIAEX Gel Extraction 
Kit as described (Ex. 3). 

The purified PGR fragment was then digested with BspEl pAXBApolApTP was 
digested with BspEl followed by treatment with SAP. The fii^EI-digested PGR fragment and 
the 5*;7EI-SAP-treated pAXBApolApTP were ligated together to create 
pAXBApol ApTP V ARM A+t 1 3 . 



b) Construction Of pBHGllApolApTPVARNA+tl3 

pAXBApolApTPVARNA+tl3 contains a 2.3 kb deletion within the pol and pTP genes 
(bp 8.773 to 1 1,091; numbering relative to that of pBHGl 1) and lacks the 1 1.4 kb BamHl 
fragment containing the right arm of the Ad genome. 

To provide the right arm of the Ad genome, pAXBApolApTPVARNA+tl3 was 
digested with BamHl followed by treatment with SAP. pBHGl 1 AE4 (pBHGl 1 or pBHGlO 
may be used in place of pBHGl 1AE4) was digested with BamHl and the 1 1.4 kb fragment 
was isolated and purified using a QIAEX Gel Extraction Kit as described (Ex. 3). The 
purified II .4 kb BamHl fragment was ligated to the iSamHI/phosphatased 
pAXBApolApTPVARNA+tl3 to generate pBHGI lApolApTPVARNA+tl3. Proper 
construction of pBHGl lApolApTPVARNA+tI3 was confirmed resttiction dieestion. 

g) Rescue And Propagation Of Ad Vectors Containing The pol, 
pTP Double Deletion 

The Ad genome contained within pBHGllApolApTPVARNA+tl3 lacks the Ad 
packaging signals. Therefore, in order to recover virus containing the 2.3 kb deletion within 
the pol and pTP genes from pBHGllApolApTPVARNA+tl3. this plasmid must be 
cotransfected into packaging cells along with DNA that provides a source of the Ad 
packaging signals. The Ad packaging signals may be provided by wild-type or mutant Ad 
viral DNA or alternatively may be provided using a shuttle vector which contains the left-end 
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Ad5 sequences including the packaging signals such as pAEisplA. pAElsplB (Microbix) or 
pAdBgUI. 

Recombinant vAd vectors containing the pol, pTP double deletion which also contain a 
deletion within the E3 region are generated by cotransfection of 

pBHGllApolApTPVARNA+tl3 (the gene of interest is inserted into the unique Pad site of 
pBHGl lApolApTPVARNA+tl3) with a E3-deleted Ad virus such as Ad5(//7001 into pol- and 
pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described (Ex. 3). 

Recombinant vectors containing the pol. pTP double deletion which also contain 
deletions within the El and E3 regions are generated by cotransfection of 
pBHGl lApolApTPVARNA+tl3 into pol- and pTP-expressing 293 cells {e.g.. C-7) along with 
a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shuttle vectors include pAElsplA (Microbix), pAElsplB (Microbix) or pAdBglll. 
The gene of interest is inserted into the polylinker region of the shuttle vector and this 
plasmid is then cotransfected into B-6 or C-7 cells along with 

pBHGllApolApTPVARNA+tl3 to generate a recombinant El", pol pTP" Ad vector 
containing the gene of interest. 



EXAMPLE 6 

Encapsidated Adenovirus Minichromosomes 
Containing A Full Length Dystrophin cDNA 



In this Example, the construction of an encapsidated adenovirus minichromosome 
(EAM) consisting of an infectious encapsidated linear genome containing Ad origins of 
replication, packaging signal elements, a p-galactosidase reporter gene cassette and a full 
length (14 kb) dystrophin cDNA regulated by a muscle specific enhancer/promoter is 
described. EAMs are generated by cotransfecting 293 cells with supercoiled plasmid DNA 
(pAdSpdys) containing an embedded inverted origin of replication (and the remaining above 
elements) together with linear DNA from El-deleted virions expressing human placental 
alkaline phosphatase (hpAP). All proteins necessary for the generation of EAMs are provided 
in trans from the hpAP virions and the two can be separated from each other on equilibrium 
CsCl gradients. These EAMs are useful for gene u-ansfer to a variety of cell types both in 
vitro and in vivo. 
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a) Generation And Propagation Of Encapsidated Adenovirus 
Minichromosomes 

To establish a vector system capable of delivering full length dystrophin cDNA clones, 
the minimal region of Ad5 needed for replication and packaging was combined with a 
conventional plasmid carrying both dystrophin and a p-galactosidase reporter gene. In the 
first \'ecior constructed, these elements were arranged such that the viral ITRs flanked (ITRs 
facing outward) the reporter gene and the dystrophin gene [U\. the vector contained from 5' 
to 3' the right or 3' ITR (mu 100 to 99), the dystrophin gene and the reporter gene and the 
left or 5" ITR and packaging sequnce (mu 1 to 0)]. Upon introduction of this vector along 
with helper virus into 293 cells, no encapsidated adenovirus minichromosomes were 
recovered. 

The second and successful vector, pAdSpdys (Fig. 10) contains 2.1 kb of adenovirus 
DNA. together with a 14 kb murine dystrophin cDNA under the control of the mouse muscle 
creatine kinase enhancer/promoter, as well as a P-galactosidase gene regulated by the human 
cytomegalovirus enhancer/promoter. 

Figure 10 shows the structure of pAdSpdys (27.8 kb). The two inverted adenovirus 
origins of replication are represented by a left and right inverted terminal repeat 
(LITR/RITR). Replication from these ITRs generates a linear genome whose termini 
correspond to the 0 and 1 00 map unit (mu) locations. Orientation of the origin with respect 
to wild type adenovirus serotype 5 sequences in mu is indicated above the figure (1 mu=360 
bp). Encapsidation of the mature linear genome is enabled by five (Al-AV) packaging signals 
(4^). The E, coli P-galactosidase and mus musculus dystrophin cDNAs are regulated by 
cytomegalovirus (CMV) and muscle creatine kinase (MCK) enhancer/promoter elements, 
respecti\ely. Both expression cassettes contain the SV40 polyadenylation (pA) signal. Since 
the El A enhancer/promoter overlaps with the packaging signals, pAdSpdys was engineered 
such that RNA polymerase transcribing from the EI A enhancer/promoter will encounter the 
SV40 late polyadenylation signal. Pertinent restriction sites used in constructing pAdSpdys 
are indicated below the figure. An adenovirus fragment corresponding to mu 6.97 to 7.77 
was introduced into pAdSpdys during the cloning procedure (described below). PI and P2 
represent location of probes used for Southern blot analysis. Restriction sites destroyed 
during the cloning of the Ad5 origin of replication and packaging signal are indicated in 
parentheses. 
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pAdSpdys was constructed as follows. pBSX (Ex. 3a) was used as the backbone for 
construction of pAdSpdys, The inverted fused Ad5 origin of replication and five 
encapsidation signals were excised as a Pstl/Xbal fragment from pAdSori. a plasmid 
containing the 6 kb HindUl fragment from pFGHO (Microbix Biosy stems). This strategy also 
introduced a 290 bp fragment from Ad5 corresponding to map units 6.97 to 7.77 adjacent to 
the right inverted repeat (see Fig. 10). The £ coli p-galactosidase gene regulated by the 
human CMV immediate early (IE) promoter/enhancer expression cassette was derived as an 
EcoRl/HindlU fragment from pCMVp [MacGregor and Caskey (1989) Nucleic Acids Res. 
17:2365]: the CMV IE enhancer/promoter is available from a number of suppliers as is the £ 
coli P-galactosidase gene]. The murine dystrophin expression cassette was derived as a 
BssHll fragment from pCVAA, and contains a 3.3 kb MCK promoter/enhancer element 
[Phelps ei ai (1995) Hum. Mol. Genet. 4:1251 and Jaynes et al (1986) Mol. Cell. Biol, 
6:2855]. The sequence of the -3.3 kb MCK promoter/enhancer element is provided in SEQ ID 
N0:9 (in SEQ ID N0:9, the last nucleotide of SEQ ID N0:9 corresponds to nucleotide +7 of 
the MCK gene). The enhancer element is contained within a 206 bp fragment located -1256 
and -1050 upstream of the transcription start site in the MCK gene. 

It was hoped that the pAd5pdys plasmid would be packageable into an encapsidated 
minichromosome when grown in parallel with an El -deleted virus due to the inclusion of both 
inverted terminal repeats (ITR) and the major Ad packaging signals in the plasmid. The ITRs 
and packaging signals were derived from pFGMO (Microbix), a plasmid that generates El- 
defective Ad particles upon transfection of human 293 cells. 

hpAP is an El deleted Ad5 containing the human placental alkaline phosphatase gene 
[Muller et ai (1994) Circ. Res. 75:1039]. This virus was chosen to provide the helper 
functions so that it would be possible to monitor the titer of the helper virus throughout serial 
passages by quantitative alkaline phosphatase assays. 

293 cells were cotransfected with pAdSpdys and hpAP DNA as follows. Low passage 
293 cells (Microbix Biosystems) were grown and passaged as suggested by the supplier. Five 
pAd5pdys and hpAP DNA (5 and 0.5 ^ig, respectively) were dissolved in 70 ^il of 20 mM 
HEPES buffer (pH 7.4) and incubated with 30 ^l of DOTAP (BMB) for 15 min. at room 
temperature. This ihixture was resuspended in 2 mis of DMEM supplemented with 2% fetal 
calf serum (FCS) and added dropwise to a 60 mm plate of 293 cells at 80% confluency. Four 
hours post-transfection the media was replaced by DMEM with 10% FCS. Cytopathic effect 
was obser\'ed 6-12 days post-transfection. 
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Cotransfection of 293 cells with supercoiled pAdSpdys and linear hpAP DNA 
produced AdSpdys EAMs (the encapsidated version of pAdSpdys) approximately 6 to 12 days 
post transfection. as evidenced by the appearance of a cytopathic effect (CPE). Initially, the 
amount of AdSpdys EAMs produced was significantly lower than that of hpAP virions so the 
viral suspension was used to re-infect fresh cultures, from which virus was isolated and used 
for serial infection of additional cultures (Fig. 1 1 ). Infection and serial passaging of 293 cells 
was carried out as follows. 

Lysate from one 60 mm plate of transfected 293 cells was prepared by vigorously 
washing the cells from the plate and centrifuging at 1 K rpm in a clinical centrifuge. Cells 
were resuspended in DMEM and 2% FCS. freeze-thawed in a dry ice-ethanol bath, cell debris 
removed by ceno-ifugation, and approximately 75% of the crude lysate was used to infect 293 
cells in DMEM supplemented with 2% FCS for 1 hour and then supplemented with 10% FCS 
thereafter. Infection was allowed to proceed for 18-20 hrs before harvesting the virus. The 
total number of cells infected in each serial passage is indicated in Figure 1 1 . 

In Figure 1 1, the total number of transducing adenovirus particles produced (output) 
per serial passage on 293 cells, total input virus of either the helper (hpAP) or AdSpdys, and 
the total number of cells used in each infection is presented. The total number of input/output 
transducing particles were determined by infection of 293 cells plated in 6-well microtiter 
plates. Twenty four hours post-infection the cells were assayed for alkaline phosphatase or p- 
galactosidase activity (described below) to determine the number of cells transduced with 
either AdSpdys or hpAP. The number of transducing particles were estimated by 
extrapolation of the mean calculated from 31 randomly chosen 2.5 mm- sectors of a 961 mm" 
plate. The intra-sector differences in total output of each type of virus are presented as the 
standard deviations, a. in Figure 11. For each serial passage. 75% of the total output virus 
from the previous passage was used for infection. 

Alkaline phosphatase or P-galactosidase activity was determined as follows. For 
detection of alkaline phosphatase, infected 293 cells on Petri dishes were rinsed twice with 
phosphate buffered saline (PBS) and fixed for 10 minutes in 0.5% glutaraldehyde in PBS. 
Cells were again rinsed twice with PBS for ten minutes followed by inactivation of 
endogenous alkaline phosphatase activity at 65''C for I lir. in PBS prior to the addition of the 
chromogenic substrate BCIP (5-bromo-4-chloro-3-indolyl phosphate) at 0. 1 5 mg/ml and nitro 
blue tetrazolium at 0.3 mg/ml) . Cells were incubated at 37''C in darkness for 3- 24 hrs. For 
P-galactosidase assays, the cells were fixed and washed as above, then assayed using standard 
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methods [MacGregor et a/. (1991) In Munay (ed.). Methods in Molecular Biology: Vol. ' 
Gene Transfer and Expression Protocols, Human Press Inc.. Clifton. NJ. pp. 217-225]. 

As shown in Figure 1 1. the rate of increase in titer of AdSpdys EAMs between 
transfection and serial passage 6 was approximately 100 times greater than that for hpAP 
virions. This result indicated that AdSpdys has a replication advantage over the helper virus, 
probably due to the shorter genome length (a difference of approximately 8 kb) and hence an 

increased rate of packaging. 

Interestingly, after serial passage 6 there was a rapid decrease in the total titer of hpAP 
virions whereas the titer of AdSpdys EAMs continued to rise. While not limiting the present 
invention to any particular mechanism, at least two possible mechanisms could be responsible 
for this observation. Firstly, a buildup of defective hpAP virions due to infections at high 
multiplicities may slowly out-compete their full length counterparts, a phenomenon that has 
been previously observed upon serial propagation of adenovirus [Daniell (1976) J. Virol. 
19:685: Rosenwirth et al. (1974) Virology 60:431; and Burlingham et al. (1974) Virology 
60:419]. Secondly, the emergence of replication competent virions due to recombination 
events between El sequences in cellular DNA and the hpAP genome could lead to a buildup 
of virus particles defective in expressing alkahne phosphatase [Lochmiiller et al. (1994) Hum. 
Gene Ther. 5:1485]. 

Southern analysis of DNA prepared from serial lysates 3, 6. 9 and 12 indicated that 
full length dystrophin sequences were present in each of these lysates (Fig. 12A). In addition, 
the correct size restriction fragments were detected using both dystrophin and p-galactosidase 
probes against lysate DNA digested with several enzymes (Figs. 12A-B). 

FiEure 12 shows a Southern blot analysis of viral DNA from lysates 3. 6. 9 and 12. 
digested with the restriction enzymes BssHlL Nru\ and EcoRV, indicating the presence of a 
full length dystrophin cDNA in all lysates. Fragments from the C terminus of mm musculus 
dystrophin cDNA (A) or the N terminus of E coli p-galactosidase (B) were labeled with 
dCTP^- and used as probes [Sambrook et al, supra]. The position of these probes and the 
predicted fragments for each digest is indicated in Figure 10. Note that one end of each 
fragment (except the 17.8 kb BssHil dystrophin fragment) detected is derived from the end of 
the linearized AdSpdys genome (see Fig. 10). Low levels of shorter products, presumably 
derived from defective virions, become detectable only at high serial passage number. 

At the later passages (9 and 12) there appeared to be an emergence of truncated 
AdSpdys sequences, suggesting that deletions and/or rearrangements may be occurring at later 
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passages. Hence, most experiments were performed with AdSpdys EAMs derived from the 
earlier passages. The possibility of the emergence of replication competent (El -containing) 
viruses was also examined by infection of HeLa cells by purified and crude serial lysates. 
none of which produced any detectable CPE. 

b) Purification Of Encapsidated Adenovirus Minichromosomes 
On CsCl Gradients 

The ability to separate AdSpdys EAMs from hpAP virions based on their buoyancy 
difference (due presumably to their different genome lengths) on CsCl gradients was 
examined. Repeated fractionation of the viral lysate allows small differences in buoyancy to 
be resolved. CsCl purification of encapsidated adenovirus minichromosomes was performed 
as follows. Approximately 25% of the lysate prepared from various passages during serial 
infections was used to purify virions. Freeze-thawed lysate was centrifuged to remove the 
cell debris. The cleared lysate was extracted twice with 1,1.2 tricholorotrifluoroeihane 
(Sigma) and applied to CsCl step and self forming gradients. 

Purification of virus was initially achieved by passing it twice through CsCl step 
gradients with densities of p=1.45 and p=1.20 in a SW28 rotor (Beckman). After isolation of 
the major band in the lower gradient, the virus was passed through a self forming gradient 
(initial p=1.334) at 37.000 rpm for 24 hrs followed by a relaxation of the gradient by 
reducing the speed to 10,000 rpm for 10 hrs. in a SW41 rotor (Beckman), at 12^C [Anet and 
Strayer (1969) Biochem. Biophys. Res. Commun. 34:328]. The upper band from the gradient 
(composed mainly of AdSpdys virions) was isolated using an 18 gauge needle, reloaded on a 
fourth CsCl gradient (p=1.334) and purified at 37,000 rpm for 24 hrs followed by 10.000 rpm 
for 10 hrs at 12°C, 

The AdSpdys-containing CsCl band was removed in 100 jil fractions from the top of 
the centrifugation tube and CsCl was removed by chromatography on Sephadex G-50 
(Pharmacia). Aliquots from each fraction were used to infect 293 cells followed by P- 
galactosidase and alkaline phosphatase assays to quantitate the level of contamination by 
hpAP virions in the final viral isolate. 

Results of the physical separation between AdSpdys EAMs and hpAP virions are 
shown in Figure 13. Figure 13 shows the physical separation of Ad5pdys from hpAP virions 
at the third (A) and final (B) stages of CsCI purification (initial p= 1.334) in a SW41 tube. 
Aliquots of AdSpdys EAMs firom the final stage were drawn through the top of the 
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centrifugation tube and assayed for p-galactosidase and alkaline phosphatase expression. 
Results of these assays are presented in Figure 14. 

In Figure 14. shows the level of contamination of AdSpdys EAMs by hpAP virions 
obtained from passage 6. Following four cycles of CsCl purification, aliquots were removed 
from the top of the centrifugation tube and used for infection of 293 cells, which were fixed 
24 hrs later and assayed for p-galactosidase or alkaline phosphatase activity (described below). 
The number of transducing particles are an underestimation of the actual totals as in some 
cases a positive cell may have been infected by more than one transducing particle. The ratio 
of the two types of virions - AdSpdys EAMs (LacZ) or hpAP (AP) in each fraction is 

indicated in the lower graph. 

The maximum ratio of transducing AdSpdys EAMs to hpAP virions reproducibly 
achieved in this study was 24.8 - a contamination with helper virus corresponding to 
approximately 4% of the final viral isolate. 

c) Dystrophin And /S-galactosidase Expression By £ncapsidated 
Adenovirus Minichromosomes In Muscle Cells 

To determine if the AdSpdys EAMs were able to express p-galactosidase and 
dystrophin in muscle cells, mouse mdx myogenic cultures were infected with CsCl purified 
EAMs. 



i) Propagation And Infection Of Muscle Cells 

MM 14 and mdx myogenic cell lines were kindly provided by S. Hauschka (University 
of Washington) and were cultured as previously described [Linkhart et a/. (1981) Dev. Biol. 
86:19 and Clegg et al. (1987) J. Cell Biol. 105:949]. Briefly, myoblasts were grown on 
_5 plastic tissue culture plates coated with a 0.1% gelatin in Ham's F-10 medium containing 15% 
(v/v) horse serum. 0.8 mM CaCU, 200 ng/ml recombinant human basic fibroblast growh 
factor (b-FGF) and 60 ng/ml genitimicin (proliferation medium). Cutures were induced to 
differentiate by switching to growth in the presence of growth medium lacking b-FGF and 
containing 10% horse serum (differentiation medium). Myoblasts or differentiated myotubes 
30 (three days post switching) were infected at a multiplicity of infection of 2.2 AdSpdys EAMs 
per cell. Fractions containing minimal contamination with hpAP virions (3. 4 and 5 of 
• passage 6) were used for western and immunofluorescence analysis. Infection was allowed to 
proceed for 3 days for both the myoblasts and myotubes before harvesting cells. 
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ii) Total Protein Extraction And Immunoblot 
Analysis 

For protein extraction, muscle cells were briefly trypsinized. transferred to a 
microcentrifuge tube, centrifuged at 14 K for 3 min at room temp and resuspended two times 
in PBS. After an additional centrifugation. the cell pellet was resuspended in 80 jal of RIPA 
buffer (50 mM Tris-CI, pH 7.5; 150 mM NaCl: 1% Nonidet P-40: 0.5% sodium 
deoxycholate; 0.1% SDS) (Sambrook et aL supra). The sample was briefly sheared using a 
22 gauge needle to reduce viscosity and total protein concentration assayed using the 
bicinconinic acid protein assay reagent (Pierce. Rockford, IL). Expression of full length 
10 dystrophin or P-galactosidase in infected mdx and MM 14 myoblasts or myotubes was 

analyzed by electrophoresis of 40 ng of total protein extract on a 6% SDS-PAGE gel (in 25 
mM Tris. 192 mM glycine. 10 mM P-mercaptoethanol. 0.1% SDS). After transferring to 
Gelman Biotrace NT membrane (in 25 mM Tris. 192 mM glycine. 10 mM P-mercaptoethanoL 
0.05% SDS. 20% methanol), the membrane was blocked with 5% non-fat milk and 1% goat- 
15 serum in Tris-buffered saline-Tween (TBS-T) for 12 hrs at 4°C. Immunostaining was done 
according to the protocol for the ECL western blotting detection reagents (Amersham Life 
Sciences. Buckingham. UK). The primary antibodies used were Dys-2 (Vector Laboratories) 
and anti-p.galactosidase (BMB. Indianapolis. IN) with a horseradish peroxidase-conjugated 
anti-mouse secondary antibody. 
20 Western blot analysis of EAM-infected mdx myoblasts and myotubes (three days post- 

fusion) indicated that EAMs were able to infect both of these cell types (Figure 15). In 
Figure 15. immunoblots of protein extracts from mdx myoblasts and myotubes demonstrating 
the expression of p-galactosidase (A) and dystrophin (B) in cells infected with Ad5pdys 
EAMs. Total protein was extracted 3 days post infection in all cases. Myotubes were 
25 infected at three days following a switch to differentiation media. In Figure 1 5A. lane 1 

contains total protein extract from 293 cells infected with a virus expressing p-galactosidase 
as a control (RSV-LacZ); lanes 2-5 contain total protein extracts from uninfected mdx 
myoblasts, mdx myoblasts infected with Ad5pdys EAMs, mdx myotubes and mdx myotubes 
derived from mdx myoblasts infected with Ad5pdys EAMs. respectively. In Figure 15B. 
30 lanes 1 and 7 contain total protein from mouse muscle ("C57") while lane 2 contains protein 
from wild type MM14 myotubes. as controls. Lanes 3-5 contain total protein extracts from 
uninfected mdx myoblasts, mdx myoblasts infected with Ad5pdys EAMs, mdx myotubes and 
mdx myotubes derived from mdx myoblasts infected with Ad5pdys EAMs. respectively. 



- 53 - 



PCT/US97/19541 

WO 98/17783 

AS shown in Figure 15. expression of p-galactosidase was detected in both the infected 
mdx myoblasts and myotubes indicating that the CMV promoter was active at both these early 
staees of differentiation in muscle cells. However, only infected mdx myotubes produced 
protein detected by dys-2. an antibody recognizing the 17 C-terminal amino acids of 
dystrophin (Fig. 15). No dystrophin expression was detected in infected myoblasts by western 
analysis, indicating that the muscle creatine kinase promoter functions minimally, if at all. 
within the AdSpdys EAM prior to terminal differentiation of these cells. 

Dystrophin expression in mdx cells infected with EAMs was confirmed by 
immunofluorescence studies using N-terminal dystrophin antibodies. In agreement with the 
western analysis, dystrophin expression from the MCK promoter was detected only in 
differentiated mdx myotubes infected by AdSpdys EAMs (Fig. 16). Figures 16A-C show 
immunofluorescence of dystrophin in wild type MM14 myotubes (A), uninfected mdx (B) and 
infected mdx myotubes (C), respectively. The results shown in Figure 16 demonstrate the 
transfer and expression of recombinant dystrophin to differentiated mdx cells by AdSpdys 
EAMs. 

Immunofluorescence of myogenic cells was performed as follows. Approximately 1.5 
X 10" MM14 or mdx myoblasts were plated on Poly-L-lysine (Sigma) coated glass slides (7 x 
3 cm) which had been previously etched witii a 0.05% chromium potassium sulfate and 0.1% 
gelatin solution. For myotube analysis, the cultures were switched to differentiation media 
[Clegg et al. (1987), supra] 48 hours after plating, immediately infected and then allowed to 
fuse for 3 days, whereas myoblasts were continuously propagated in proliferation media 
[Clegg et al. (1987). supra]. Cells were washed three times with PBS at room temperature 
and fixed in 3.7% formaldehyde. For immunostaining. cells were incubated in 0.5% Triton 
X-100. blocked with 1% normal goat serum and incubated with an affinity purified antibody 
) against the N-terminus of murine dystrophin for 2 hrs. followed by extensive washing in PBS 
and 0.1% Tween-20 with gentle shaking. Cells were incubated with a 1:200 dilution of biotin 
conjugated anti-rabbit antibody (Pierce) for one hour and washed as above. Cells were 
further incubated with a 1:300 dilution of streptavidin-fluorescein isotiiiocyanate conjugate 
(Vectorlabs. Burlingam. CA) for one hour and washed as above, followed by extensive 

0 washing in PBS. 

The above results show that embedded inverted Ad origins of replication coupled to an 
encapsidation signal can convert circular DNA molecules to linear forms in the presence of 
helper virus and that tiiese genomes can be efficientiy encapsidated and propagated to high 
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titers. Such viruses can be purified on a CsCl gradient and maintain their abilit>' to transduce 
cells in vitro, and their increased cloning capacity allows the inclusion of large genes and 
tissue specific gene regulatory elements. The above results also show that the dystrophin gene 
was expressed in cells transduced by such viruses and that the protein product was correctly 
localized to the cell membrane. The above method for preparing EAMs theoretically enables 
virtually any gene of interest to be inserted into an infectious minichromosome bv 
conventional cloning in plasmid vectors, followed by cotransfection with helper viral DNA in 
293 cells. This approach is useful for a variety of gene transfer studies in vitro. The 
observation that vectors completely lacking viral genes can be used to transfer a ftill-length 
dystrophin cDNA into myogenic cells indicates that this method may be used for the 
treatment of DMD using gene therapy. 

d) A Modified MCK Enhancer Increases Expression Of Linked 
Genes In Muscle 

The DNA fragment containing enhancer/promoter of the MCK gene utilized in the 
AdSpdys EAM plasmid is quite large (--3.3. kb). In order to provide a smaller DNA fragment 
capable of directing high levels of expression of linked genes in muscle cells, portions of the 
3.3 kb MCK enhancer/promoter were deleted and/or modified and inserted in front of a 
reporter gene (lacZ), The enhancer element of the MCK gene was modified to produce the 
2RS5 enhancer: the sequence of the 2RS5 enhancer is provided in SEQ ID NO: 10. The first 
6 residues of SEQ ID NO: 10 represent a Kpnl site added for ease of manipulation of the 
modified MCK enhancer element. Residue number 7 of SEQ ID NO: 10 corresponds to 
residue number 2164 of the wild-type MCK enhancer sequence listed in SEQ ID N0:9 
(position 2164 of SEQ ID N0:9 corresponds to position -1256 of the MCK gene). Residue 
number 174 of SEQ ID NO: 10 corresponds to residue number 2266 of the wild-type MCK 
enhancer sequence listed in SEQ ID N0:9. 

These MCK/lacZ constructs were used to transfect cells in culture {i.e.. myogenic 
cultures) or were injected as naked DNA into the muscle of mice and P-galactosidase activity 
was measured. Figure 1 7 provides a schematic of the MCK/lacZ constructs tested. The first 
construct shown in Figure 17 contains the 3.3. kb wild-type MCK enhancer/promoter 
fragment linked to the E, coli lacZ gene. The wild-type enhancer element (-1256 to -1056) 
is depicted by the box containing '*E": the core promoter element (-358 to -80) is indicated by 
the light cross-hatching and the minimal promoter element (-80 to +7) is indicated by the dark 
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cross-hatching. The core promoter element is required in addition to the minimal promoter 
element (which is required for basal expression) in order to achieve increased muscle-specific 
expression in tissue culture [Shield et al. (1996) Mol. Cell. Biol. 16:5058]. The modified 
enhancer element, the 2RS5 enhancer, is indicated by the box labeled "E*." The box labeled 
"mi nx" contains a synthetic intron derived from adenovirus and is used to increase expression 
of the constructs (any intron may be utilized for this purpose). The box labeled "CMV" 
depicts the CMV IE enhancer/promoter which was used a positive control. 

In figure 17, p-galactosidase activity is expressed relative to the activity of the wild- 
type MCK enhancer/promoter construct shown at the top of the figure. As shown in Figure 
17. a construct containing the 2RS5 enhancer (SEQ ID NO: 10) linked to either the minimal 
MCK promoter (a ~ 261 bp element) or the core and minimal MCK promoter elements (a -539 
bp element) directs higher levels of expression of the reporter gene in muscle cells as 
compared to the -3.3 kb fragmem containing the wild-type enhancer elemem. These modified 
enhancer/promoter elements are considerably smaller than the -3.3 kb fragment used in the 
AdSpdys EAM plasmid and are useful for directing the expression of foreign genes in muscle 
cells. These smaller elements are particularly useful for driving the expression of genes in the 
context of self-propagating adenoviral vectors which have more severe constraints on the 
amount of foreign DNA which can be inserted in comparison to the used of "gutted" 
adenoviruses such as the EAMs described above. 

EXAMPLE 7 

Generation Of High Titer Stocks Of Encapsidated Adenovirus 
Minichromosomes Containing Minimal Helper Virus Contamination 

•5 The results presented in Example 6 demonstrated that encapsidated adenovirus 

minichromosomes (EAMs) can be prepared that lack all viral genes and which can express 
full-length dystrophin cDNAs in a muscle specific manner. The propagation of these EAMs 
requires the presence of helper adenoviruses that contaminate the final EAM preparation with 
conventional adenoviruses (about 4% of the total preparation). In this example the EAM 

30 system is modified to enable the generation of high titer stocks of EAMs with minimal helper 
virus contamination. Preferably the EAM stocks contain helper virus representing less than 
1%. preferably less than 0.1% and most preferably less than 0.01% of the final viral isolate. 
Purified EAMs are then injected in vivo in muscles of dystrophin minus mdx mice to 
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determine whether these vectors lead to immune rejection and whether thev can alleviate 
dystrophic symptoms in mdx muscles. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
versus the EAM virus. The second is to improve physical methods for separating EAM from 
helper virus. These approaches enable the generation of dystrophin-expressing EAMs that are 
contaminated with minimal levels of helper virus. 

a) Development And Characterization Of Adenovirus Packaging 
Cell Lines Expressing The Cre Recombinase From 
Bacteriophage PI 

Cell lines expressing a range of Cre levels are used to optimize the amount of helper 
virus packaging that occurs during growth of the EAM vectors. The Cre-loxP system [Sauer 
and Henderson (1988) Proc. Natl. Acad. Sci. USA 85:5166] is employed to selectively disable 
helper \ irus packaging during growth of EAMs. The bacterial Cre recombinase catalyzes 
efficient recombination between a 34 bp target sequence called loxP. To delete a desired 
sequence. loxP sites are placed at each end of the sequence to be deleted in the same 
orientation: in the presence of Cre recombinase the intervening DNA segment is efficiently 
excised. 

Cell lines expressing a range of Cre recombinase levels are generated. The expression 
of too little Cre protein may result in high levels of helper virus being generated, which leads 
to unacceptably high levels of helper virus contaminating the final EAM preparation. If very 
high levels of Cre expression are present in a cell line, excision of the packaging signal from 
the helper virus would be 100% efficient (/.«?., it would completely prevent helper virus 
packaging). As shown in Example 6, serial passage of EAM preparations containing low 
levels of helper vims increased the titer of the EAM. Therefore, it is desirable, at least in the 
initial passages of a serial passage that some helper virus capable of being packaged is 
present. A low level of packagable helper virus may be provided by using a cell line 
expressing levels of Cre recombinase which are not high enough to achieve excision of the 
packaging signals from 100% of the helper virus; these cell lines would be used early in the 
serial passaging of the EAM stock and a cell line expressing high enough levels of Cre 
recombinanse to completely prevent helper virus packaging would be used for the final 
passage. 
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Alternatively. EAMs may be prepared using a packaging cell line that suppons high 
efficiency Cre recombinase-mediated excision of the packaging signals from the helper virus 
bv transfection of the Oe-expressing cell line with the EAM plasmid followed by infection of 

m 

these cells with /oxP-contaning helper virus using an MOl of > 1 .0. 

Human 293 cell lines that express a variety of levels of Cre recombinase are generated 
as follows. An expression vector containing the Cre coding region. pOG23l. was 
cotransfected into 293 cells along with pcDNA3 (Invitrogen); pcDNA3 contains the mo gene 
under the control of the SV40 promoter. pOG231 uses the human CMV enhancer/promoter 
[derived from CDM8 (Invitrogen)] to express a modified Cre gene [pOG231 was obtained 
from S. O'Gorman]. The modified Cre gene contained within pOG231 has had a nuclear 
localization signal inserted into the coding region to increase the efficiency of recombination. 

pOG231 was constructed as follows. A Bg/II site was introduced into the 5' Xba\ site 
of the synthethic intron of pMLSISCAT [Haung and Gorman (1990) Nucleic Acids Res. 
18:937] by linker tailing. A Bg/II site in the synthethic intron of pMLSISCAT was destroyed 
and a BamWl linker was inserted into the Pst\ site at the V end of the synthethic intron in 
pMLSISCAT. flgni and Smal sites and. a nuclear localization signal were introduced into the 
5' end of pMC-Cre [Gu *;/ al. (1993) Cell 73:1 155] using a PCR fragment that extended from 
the novel Bg/II and Smal sites to the Bamm. site in the Cre coding region. This PCR 
fragmem was ligated to a BammiSaH fragment containing a portion of the Cre coding region 
derived from pIC-Cre (Gu el ai. supra) and the intron plus Cre coding sequence %vas inserted 
into a modified form of pOG44 [O'Gorman et al. (1991) Science 251:1351] to generate 
pOG231. The predicted sequence of pOG23l from the Bg/II site to the BamYW site located in 
the middle of the Cre coding sequence is listed in SEQ ID N0:1 1. 

One 60 mm dish of 293 cells (Microbix) were transfected with 10 ^lg of PvuW- 
Unearized pOG23l and 1 ^g of JVort-linearized pcDNA3 using a standard calcium phosphate 
precipitation protocol. Two days after the addition of DNA, the transfected cells were split 
into three 100 mm dishes and 1000 jig/ml of active G418 was added to the medium. The 
cells were fed periodically with G418-contaning medium and three weeks later. 24 G418- 

resistant clones were isolated. 
I The isolated clones were expanded for testing. Aliquots were frozen in liquid nitrogen 

at the earliest possible passage. The neomycin resistant cell lines were examined for the 
expression of Cre recombinase using following transfection assay. The neomycin resistant 
cells were transfected with PGK-l-GFP-lacZ (obtained from Sally Camper. Univ. of 
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Michigan. Ann Arbon MI), which contains a green fluorescent protein (GFP) expression 
cassette that can be excised by Cre recombinase to allow expression of P-galactosidase: 
transfection was accomplised using standard calcium phosphate precipitation. Figure 18 
provides a schematic of a GFP/p-gal reporter construct suitable for assaying the expression of 
Cre recombinase in mammalian cells; GFP sequences and P-gal (/.ef., lacZ) sequences are 
avaialbale from commmerical sources {e,g,, Clonetech, Palo Alto. CA and Pharmacia Biotech, 
Piscataway, NJ. respectively). 

Control experiments verified that 293 cells transfected with PGK-l-GFP-lacZ 
expressed significant amounts of P-galactosidase only if these cells also expressed Cre 
recombinanse. P-galactosidase assays were performed as described in Example 6. Neomycin 
resistant cells expressing Cre recombinase were grouped as high, medium or low expressors 
based upon the amount of P-galactosidase activity produced (estimated by direct counting of 
P-galactosidase-positive cells per high-power field and by observing the level of staining and 
the rapidity with which the blue stain was apparent) when these cell lines were transfected 
15 with the GFP/p-gal reporter construct Thirteen positive (i.e.. Ot'-expressing) lines, D608#12. 
#7. #22. #18. #17, #4, #8, #2, #2/2, #13, #5. #15 and #21 were retained for further use. 

The results of this transfection analysis revealed that cultures of 293 cells expressing 
medium to high levels of Cre recombinase could be generated without apparent toxicity. 

-0 b) Generation Of Helper Adenovirus Strains That Contain loxP 

Sites Flanking The Adenovirus Packaging Signals 
Studies of EAM production demonstrated that the EAM vector has a packaging 
advantage over the helper adenovirus (Ex. 6). While not limiting the present invention to a 
particular mechanism, it is hypothesized that this packaging and replication advantage can be 

25 greatly increased by using helper viruses that approach the packaging size limits of Ad5 [Belt 
et ai (1993) J. Virol. 67:5911], by using viruses with mutations in E4 and/or E2 genes [Yeh 
et ai (1996) J. Virol. 70:559; Gorziglia et al. (1996) J. Virol. 70:4173; and Amalfilano et ai 
(1996) Proc. Natl. Acad. Sci. USA 93:3352], by inclusion of mutations or alterations in the 
packaging signals of the helper virus [Imler et ai (1995) Hum. Gene Ther. 6:71 1] and by 

30 combining these strategies. 

The Cre-loxP excision method is used to disable the packaging signals from the helper 
virus genomes. The Ad5 packaging domain extends from nucleotide 194 to 358 and is 
composed of five distinct elements that are functionally redundant [Hearing et ai (1987) J. 
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Virol. 61:2555]. Theoretically, any molecule containing the AdS origin ot replication and 
packaging elements should replicate and be packaged into mature virions in the presence of 
non-defective helper virus. Disabling the packaging signals should allow replication and gene 
expression to proceed, but will prevent packaging of viral DNA into infectious particles. This 
in turn should allow the ratio of EAM to helper virus to be increased greatly. 

To disable the packaging signals within the helper virus used to encapsidate the EAMs. 
the loxP sequences are incorporated into a helper virus that has a genome approaching the 
maximal packaging size for Ad [-105 map units; Bett et al. (1993). supra] which further 
decreases the efficiency of helper virus packaging. Final virus size can be adjusted by the 
choice of introns inserted into the reporter gene, by the choice of reporter genes, or by 
including a variety of DNA fragments of various sizes to act as "smffer" fragments. A 
convenient reporter gene is the alkaline phosphatase gene (see Ex. 6). The optimized Cre- 
loxP system is also incorporated into a helper viral backbone containing disruptions of any or 
all of the E1-E4 genes. Use of such deleted genomes requires viral growth on appropriate 
complementing cell lines, such as 293 cells expressing E2. and/or E4 gene products. The 
loxP sequences are incorporated into Uie helper virus by placing the loxP 
sequences on either side of the packaging signals on a shuttle vector. This modified shuttle 
vector is then used to recombine with the Ad DNA derived from a virus containing 
disruptions of any or all of the E1-E4 genes to produce the desired helper virus containing the 
packaging signals flanked by loxP sequences. 

pAdBglll. an adenovirus shuttle plasmid containing Ad sequences from O-l and 9-16 
map units (mu) was used as the starting material. Synthetic oligonucleotides were used to 
create a polylinker which was inserted at 1 mu within pAdBglU as follows. The Bglll LoxP 
oligo[5-GAAGATCTATAACTTCGTATAATGTATGCTA 

TACGA.\GTTATTACCGAAGAAATGGCTCGAGATCTTC-3- (SEQ ID N0:12) and its 
reverese complement [5'-GAAGATCTCGAGCCATTTCTTCGGTAATAACTTCGT 
ATAGC.\TACATTATACGAAGTTATAGATCTTC-3' (SEQ ID N0:13)] were synthesized. 
The Afllll LoxP oligo [5'-CCACATGTATAACTTCGTATAGCATACA 
TTATACGAAGTrATACATGTGG-3' (SEQ ID NO.H)] and its reverse complement [5'- 
CCACATGTATAACTTCGTATAATGTATGCTATACGAAGTTATACATG TGG-3' (SEQ 

ID NO: 15)1 were synthesized. The double stranded form of each loxP oligonuceotide was 
digested with the appropriate restriction enzyme {e.g., Bglll or AfllU) and inserted into 
pAdBglll which had been digested with Bglll and AfllU This resulted in the insertion of 
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LoxP sequences into the shuttle vector flanking the packaging signals that are located between 
0.5 and 1 mu of Ad5 (one mu equals 360 bp in the Ad5 genome; the sequence of the Ad5 
genome is Isited in SEQ ID N0:1). The 3' loxP sequence was inserted into the Bglll site 
within pAdBglll. The 5' loxP sequence was inserted into the AfRW site located at base 143 
(-0.8 mu) (numbering relative to Ad5). 

DNA sequencing was used to verify the final structure of the modified shuttle plasmid 
between 0-1 mu. UCre recombinase-mediated excision of the packaging signals is found to 
be too efficient (as judged by the production of too little helper virus), alternate sites for the 
insertion of the loxP sequences are used that would result in deletion of 2, 3 or 4 packaging 
sequences rather than all 5 [Grable and Hearing (1990) J. Virol. 64:2047]. The insertion of 
loxP sequences at sites along the Ad genome contained within the shuttle vector which would 
result in the deletion 2. 3 or 4 packaging sequences are easily made using the technique of 
recombinant PGR [Higuchi (1990) In: PCR Protocols: A Guide to Methods and 
Applications. Innis et ai (eds.) Academic Press. San Diego, CA, pp. 177-183]. The optimal 
amount of Cve recombinase-mediated excision of the packaging signals is that amount which 
permits the production of enough packaged helper virus to permit the slow spread of virus on 
the first plate of cells co-transfected with helper virus DNA (containing the loxP sequences) 
and the EAM vector. This permits serial passage of the EAM preparation onto a subsequent 
lawn of cells to increase the titer of the EAM preparation. Alternatively, if Cre recombinase- 
mediated excision of the packaging signals is esseritially 100% efficient in all Cre-expressing 
cells lines {i,e.. regardless of the level of Cre-expression, that is even a cell line expressing a 
low level of Cre as judged by the GFP/p-gal assay described above), the packaged EAMs 
may be used along with helper virus (used at a MOI of -1.0) to infect the second or 
subsequent lawn of cells to permit serial passaging to increase the titer of the EAM 
preparation. 

Following introduction of the loxP sequences into the shuttle vector, the human 
placental alkaline phosphatase (HpAp) cDNA under control of the RSV promoter was inserted 
into the polylinker to provide a reporter gene for the helper virus. This is the same reporter 
used previously during EAM generation (Ex. 6). The HpAp sequences were inserted as 
follows. The /f^xP-containing shuttle vector was linearized with Xho\ and the hpAp cassette 
was ligated into the Xho\ site (a Xho\ site was inserted into pAdBglll during the insertion of 
the loxP sequences as a Xho\ site was located on the 3' end of the loxP sequences inserted 
into the B^IW site of pAdBglll). The HpAp cassette was constructed as follows. 
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■pRSVhAPT40 (obtained from Gary Nabel. Univ. of Michigan. Ann Arbor. Ml) was digested 
with £c«Rl to generate an £coRl fragment containing the HpAP cDNA and the SV40 intron 
and polyadenylation sequences. pRc/RSV (Invitrogen) was digested with Hmdlll. then 
partially digested with EM. A 5.208 bp fragment was then size selected on an agarose gel. 
treated with calf alkaline phosphatase, and ligated to the £coRI fragment derived from 
pRSVhAPT40 to generate pRc/RSVAP. pRc/RSVAP was then digested with Sail and Xhol to 
liberate the RSV promoter linked to the HpAP cDNA cassette (including the SV40 intron and 
polyadenylation sequences). This Sall-Xhol fragment was inserted into the te/>-containing 
shuttle vector which had been digested with Sail and Xhol to generate pADLoxP-RSVAP. 

Helper virus containing loxP sites flanking the Ad packaging signals is generated by 
co-transfection of the LoxP shuttle plasmid (pADLoxP-RSVAP) and C/oI-digested 
Ad5c//700l DNA into 293 cells [Graham and Prevec (1991) Manipulation of Adenovirus 
Vectors. In Methods in Molecular Biology. Vol. 7: Gene Transfer and Expression Protocols. 
Murray (ed.). Humana Press Inc.. Clifton. NJ. pp. 109-128]. Figure 19 provides a schematic 
showinu the recombination event between ihc loxP shuttle vector and the Ad5 J/7001 genome. 
Co-transfection is carried out as described in Example 3c. 

Alternatively, the reporter gene may be inserted into the E3 region of pBHGlO or 
pBHGI 1 (using the unique Pad site) rather than into the polylinker located in the El region 
of the shuttle vector. The reporter gene-containing pBHGlO or 1 1 is then used in place of 
AdSJimi for cotransfection of 293 cells along with the loxP-conUining pAdBglll. 

Following cotransfection, recombinant plaques are picked, plaque purified, and tested 
for incorporation of both hpAp and loxP sequences by PCR and Southern analysis (Ex. 3). 
Viruses which contain hxP sites flanking the packaging signals and the marker gene (hpAp) 

are retained, propagated and purified. 

The isolated helper virus containing loxP sites flanking the packaging signals and the 
marker gene is then used to infect both 293 cells and the 293 cell lines that express Cre 
(section a. above). Cre recombinase-expressing cell lines that produce optimal levels of Cre 
recombinase when infected with the /oxf-containing helper virus are then used for the 
•veneration of EAMs as described below. 
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c) Generation Of Encapsidated Adenovirus Minichromosomes 
That Express Dystrophin 

The growth of EAMs is optimized using the following methods. In the first method, 
plasmid DNA from the EAM vector (Ex. 6) are co-transfected into 293 cells with purified 
viral DNA from the helper virus, and viruses are harvested 10-14 days later after appearance 
of a viral cytopathic effect (CPE) as described in Example 6. This approach is simple, yet 
potentially increases helper virus levels by allowing viral spread throughout the culture dishes. 

In the second method, the co-transfected 293 cells are overlaid with agar and single 
plaques are picked and tested for EAM activity by the ability to express (J-galactosidase 
activity following infection of Hela cells (Ex. 6). This second approach is more time 
consuming, but will result in less contamination by helper. 

The ability of these two methods to generate EAMs are directly compared. The 
preferred method is that which produces the highest titer of EAM with the lowest 
contamination of helper virus. The efficiency of EAM generation may also be increased by 
using helper virus DNA that retains the terminal protein (TP) {Lc. the helper virus is used as 
a viral DNA-TPC as described in Ex. 3) The use of viral DNA-TPC has been shown to 
increase the efficiency of viral production following transfection of 293 cells by an order of 
magnitude [Miyake et al. (1996), supra]. 

The initial transfection will utilize 293 cells that do not express Cre recombinase. so 
that efficient spread of the helper can lead to large scale production of EAM. The method 
producing the highest ratio of EAM to helper virus is then employed to optimize conditions 
for the serial propagation of the EAM on 293 cells expressing CVe recombinase. This 
optimization is conducted using cell lines expressing different levels of Cre recombinase. The 
following variables are tested: 1) the ratio of input viral titer to cell number, 2) the number 
of serial passages to use for EAM generation, 3) the use of cell lines producing different 
levels of Cre to achieve the optimal ratio between high EAM titer and low helper titer, 4) 
continuous growth on Cre-producing 293 cells versus alternating between Cre-producing cells 
and the parental 293 cells, or to alternate between high and low Cre-producing cells and 5) 
CsCl purification of EAMs prior to re-infection of 293 cells increases the ratio of the final 
EAM/helper titers. The protocols that result in the highest yield of EAM with minimal helper 
virus are used to generate large volumes of crude viral ly sates for purification by density 
gradient centrifugation (Ex. 6). 
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EAMs were purified from helper virus using standard CsCl density gradient 
centrifugation protocols in Example 6 and resulted in preparations containing -4% helper virus. 
In order to improve the physical separation of EAMs and helper virus, a variety of different 
centrifugation conditions are possible, including changing the gradient shape, type of rotor and 
tube used, combinations of step and continuous gradients, and the number of gradients used. 
Materials with better resolving powers than CsCl can also be employed. These include 
rubidium chloride and potassium bromide [Reich and Zarybnicky ( 1 979) Annal. Biochem. 
94:193]. 

d) Use Of EAMs Encoding Dystrophin For Long Term 
Expression Of Dystrophin in Muscle 

To demonstrate the ability of the purified dystrophin expressing EAMs (prepared as 
described above) to deliver and express dystrophin in the muscle of an animal, the following 
experiment is conducted. First, purified dystrophin-EAMs are delivered to the muscle by 
direct intramuscular injection into newborn, 1 month, and adult (3 month) mouse quadriceps 
muscle. The dystrophin expressing EAM AdSpDys are injected into mdx mice and into 
transgenic mclx mice that express P-galactosidase in the pituitary gland [Tripathy et al. (1996) 
Nature Med. 2:545]. These latter mice are used to avoid potential immune-rejection of cells 
expressing p-galactosidase from the AdSpDys vector. An alternate EAM lacking the P- 
galactosidase reporter gene may also be employed; however, the presence of the p-Gal 
reporter simplifies EAM growth and purification (vectors lacking p-Gal have their purity 
estimated by PCR assays rather than by P-Gal assays). 

Following intramuscular injection of EAM. animals are sacrificed at intervals between 
1 week and 6 months to measure dystrophin expression [by western blot analysis and by 
immunofluorescence (Phelps et al. (1995) Hum. Mol. Genet. 4:1251 and Rafael el al. (1996) 
J. Cell Biol. 134:93] and muscle extracts will also be assayed for p-Gal activity [MacGregor 
et al. (1991). supra]. These results are compared with previous results obtained using cunent 
generation viral vectors {i.e.. containing deletions in El and E3 only) to demonstrate that 
EAMs improve the prospects for long term gene expression in muscle. 
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e) Use Of Dystrophin-EAMs To Prevent, Halt, Or Reverse The 
Dystrophic Symptoms That Develop In The Muscles Of mdx 
Mice 

To demonstrate that beneficial effects on dystrophic muscle are achieved by delivery 
of dystrophin expressing EAMs to mdx mice, the following experiments are conducted. 
Central nuclei counts are performed on soleus muscle at intervals following injection of EAM 
into newborn. 1 month, and 3 month (adult) mice (Phelps et ai, 1995). Central nuclei arise 
in mouse muscle only after a myofiber has undergone dystrophic necrosis followed by 
regeneration, and is a quantitative measure of the degree of dystrophy that has occurred in a 
muscle group (Phelps et al, 1995). 

More informative assays are also contemplated. These assays require administration of 
EAMs to mouse diaphragm muscles. 

The diaphragm is severely affected in mdx mice (Stedman et ciL. 1991: Cox et ai. 
1993). and displays dramatic decreases in both force and power generation. Administration of 
virus to the diaphragm will allow the strength of the muscles to be measured at intervals 
following dystrophin delivery. The force and power generating assays developed by the 
Faulkner lab are used to measure the effect of dystrophin transgenes (Shrager et ai, 1992; 
Rafael et ai. 1994; Lynch et aL. 1996). 

To detemine whether dystrophin delivery to dystrophic muscle reverses dystrophy or 
stabilizes muscle, varying amounts of the dystrophin EAM are delivered to the to diaphragm 
at different stages of the dystrophic process and then strength is measured at intervals 
following EAM administration. First, various titers of EAM are tested to determine the 
minimal amount of virus needed to transduce the majority of muscle fibers in the diaphragm. 
It has been shown that conventional adenovirus vectors can transduce the majority of 
diaphragm fibers when 10 pfu are administrated by direct injection into the intraperitoneal 
cavity [Huard et al (1995) Gene Therapy 2:107]. In addition, it has been shown that 
transduction of a simple majority of fibers in a muscle group is sufficient to prevent virtually 
all the dystrophic symptoms in mice [Rafael et al. (1994) Hum. Mol. Genet. 3:1725 and 
Phelps et aL (1995) Hum. Mol. Genet. 4:1251]. Virus is administered to mdx animals at three 
different ages (neonatal 1 month, and 3 months). Animals are sacrificed for physiological 
analysis of diaphragm muscle at two different times post infection ( 1 month and 3 months). 
Error control is achieved by performing these experiments in sextuplicate. Conn-ol animals 
consist of mock injected wild-type (C57B1/10) and dystrophic (mdx) mice. Three month old 
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mdx mice display a 40% reduction in force and power generation compared with wild-type 
mice, while 6 month animals display a greater than 50% reduction (Cox et al. (1993) Nature 
364:725: Rafael et al. (1994). supra; Phelps et al. (1995). supra: and Corrado et al. (1996) J, 
Cell. Biol. 134:873]. 

EXAMPLE 8 
Improved Shuttle Vectors For The 
Production Of Helper Virus Containing LoxP Sites 



In the previous example, a shuttle vector containing adenoviral sequences extending 
from 9 mu to 16 mu of the AdS genome was modified to contain loxP sequences surrounding 
the packaging signals (the LoxP shuttle vector). This modified shuttle vector was then 
recombined with an Ad virus to produce a helper virus containing the loxP sequences. This 
helper virus was then used to infect cells expressing Cre recombinase along with DNA 
comprising a minichromosome containing the dystrophin gene, a reporter gene and the 
packaging signals and ITRs of Ad in order to preferentially package the minichromosomes. 
Using this approach the helper virus, which has had the packaging signals removed by Cre- 
loxP recombination, contains the majority of the Ad genome (only a portion of the E3 region 
is deleted). Thus, if low levels of helper virus are packaged and appear in the EAM 
preparation, the EAM preparation has the potential of passing on helper virus capable of 
directing the expression of Ad proteins in cells which are exposed to the EAM preparation. 
The expression of Ad proteins may lead to an immune response directed against the infected 

cells. 

Another approach to reducing the possibility that the EAM preparation contains helper 
5 virus capable of provoking an immune response is to use helper viruses containing deletions 
and/or mutations within the pol and pTP genes. Helper virus containing a deletion in the pol 
and/or pTP genes is cotransfected with the EAM construct into 293-derived cell lines 
expressing pol or pol and pTP to produce EAMs. Any helper virus presem in the purified 
EAM preparation will be replication defective due to the deletion in the pol and/or pTP genes. 
0 As shown in Example 3, viruses containing a deletion in the pol gene are incapable of 

directing the expression of viral late genes: therefore, helper viruses containing a deletion in 
■ the pol gene or the pol and pTP genes should not be capable of provoking an immune 
response (j.c.. a CTL response) against late viral proteins synthesized de novo. Shuttie 
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vectors contain deletions within the Ad pol and/or pTP genes are constructed as described 
below. 

a) Construction Of A Shuttle Vector Containing The Apol 
Deletion 



pAdBglll was modified to contain sequences corresponding to 9 to 40 mu of the Ad5 
genome as follows. pAdBglll was digested with Bglll and a linker/adapter containing an Ascl 
site was added to create pAdBglllAsc. pAdBglllAsc was then digested with 5^/1 1071. 
Ad5i//7001 viral DNA was digested with Ascl and the ends were filled in using T4 DNA 
polymerase. The ^irl-digested. T4 DNA polymerase filled Ad5^/7001 viral DNA was then 
10 digested with 5^/1 1071 and the ^9.9 kb AscUBstl 107I(filIed) fragment containing the pol and 
pTP genes was isolated (as described in Ex. 3) and ligated to the Bsil 1071-digested 
pAdBglllAsc to generate pAdAsc. pAdAsc is a shuttle vector which contains the genes 
encoding the DNA polymerase and preterminal protein (the inserted >l.ycl-/?.i/l 1071 fragment 
corresponds to nucleotides 5767 to 15. 671 of the Ad5 genome). 
•5 A shuttle vector. pAdAscApoU which contains the 612 bp deletion in the pol gene 

(described in Ex. 3) was constructed as follows. Ad5ApolAE3I viral DNA (Ex. 3) was 
digested with Ascl and the ends were filled in using T4 DNA polymerase. The y45d-digested, 
T4 DNA polymerase filled Ad5ApolAE3I viral DNA was then digested with fl.9/1107I and the 
-9.3 kb Ascl-Bst I l07l(Med) fragment containing the deleted pol gene and the pTP gene was 
20 isolated (as described in Ex. 3) and ligated to Bstl 1071-digested pAdBglllAsc to generate 
pAdAscApol. 

b) Construction Of A Shuttle Vector Containing The ApolApTP 
Deletion 

A shuttle vector, pAdAscApolApTP, which contains a 2.3 kb deletion within the pol 
and pTP genes (described in Ex. 5) was constructed as follows. 

pAXBApolApTPVARNA+tl3 (Ex, 5b) was digested with Ascl and the ends were filled in 
using T4 DNA polymerase. The ^icl-digested, T4 DNA polymerase filled 
pAXBApolApTPVARNA+tl3 DNA was then digested with Bsil 1071 and the --7.6 kb ^^cl- 
Bstl 107I(filIed) fragment containing the deleted pol gene and the pTP gene was isolated (as 
0 described in Ex. 3) and ligated to Bsi\ 1071-digested pAdBglllAsc to generate 
pAdAscApolApTP. 

In order to reduce the packaging of the above helper viruses, the pol" or pol", pTP" 
helper \ iruses can be modified to incorporate loxP sequences on either side of the packaging 
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signals as outlined in Example 7. The toP-containing pol' or pol". pTP~ shuttle vectors 
(pTP- shuttle vectors may also be employed) are cotransfected into 293 cells expressing Cre 
recombinase and pol or Cre recombinase. pol and pTP. respectively along with an appropriate 
El" viral DNA-TPC (the El- viral DNA may also contain deletions elsewhere in the genome 
such as in the E4 genes or in the E2a gene as packaging cell lines expressing the E4 ORF 6 
or 6/7 and lines expressing E2a genes are avaialble) to generate helper virus containing loxP 
sites flanking the packaging signals as well as a deletion in the pol gene or the pol and pTP 
genes. The resulting helper virus(es) is used to cotransfect 293 cells expressing pol or pol 
and pTP along with the desired EAM construct. The resulting EAM preparation should 
contain little if any helper virus and any contaminating helper virus present would be 
replication defective and incapable of expressing viral late gene products. Helper viruses 
containing loxP sequences and deletions in all essential early genes may be employed in 
conjunction with Cre recombinanse-expressing cell lines expressing in trans the El. E4 ORF 
6. E2a. and E2b (e.g.. Ad polymerase and pTP) proteins (the E3 proteins are dispensible for 
growth in culture). Cell lines coexpressing El, polymerase and pTP are provided herein. 
Cell lines expressing El and E4 proteins have been recently described [Krougliak and Graham 
(1995) Hum. Gene Ther. 6:1575 and Wang et ai (1995) Gene Ther. 2:775] and cell lines 
expressing El and E2a proteins have been recently described [Zhou et al (1996) J. Virol. 
70:7030]. Therefore, a cell line co-expressing El, E2a, E2b, and E4 is consructed by 
introduction of expression plasmids containing the E2a and E4 coding regions into the E1-. 
Ad polymerase- and pTP-expressing cell lines of the present invention. These packaging cell 
lines are used in conjunction with helper viruses containing deletions in the El. E2a. E2b and 
E4 regions. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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SEQUENCE LISTING 

(X) GENERAL INFORMATION: 

(i) APPLICANT: Chamberlain, Jeffrey S. 

Amalfitano, Andrea 
Hauser, Michael A. 
Kumar -Si ngh , Raj endr a 
Hartigan-0' Connor, Dennis J. 

(ii) TITLE OF INVENTION: IMPROVED ADENOVIRUS VECTORS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Median & Carroll, LLP 

(B) STREET: 220 Montgomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

<E) COUNTRY: United States Of America 
(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

U) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
<A) NAME: Ingolia, Diane E. 

(B) REGISTRATION NUMBER: 40,027 

(C) REFERENCE/DOCKET NUMBER: UM-02484 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3 5 935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



CATCATCAAT 


AATATACCTT 


ATTTTGGATT 


GAAGCCAATA 


TGATAATGAG 


GGGGT6GAGT 


60 


TTGTGACGTG 


GCGCGGGGCG 


TGGGAACGGG 


GCGGGTGACG 


TAGTAGTGTG 


GCGGAAGTGT 


120 


GATGTTGCAA 


GTGTGGCGGA 


ACACATGTAA 


GCGACGGATG 


TGGCAAAAGT 


GACGTTTTTG 


180 


GTGTGCGCCG 


GTGTACACAG 


GAAGTGACAA 


TTTTCGCGCG 


GTTTTAGGCG 


GATGTTGTAG 


240 


TAAATTTGGG 


CGTAACCGAG 


TAAGATTTGG 


CCATTTTCGC 


GGGAAAACTG 


AATAAGAGGA 


300 


AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


GGGCCGCGGG 


360 


GACTTTGACC 


GTTTACGTGG 


AGACTCGCCC 


AGGTGTTTTT 


CTCAGGTGTT 


TTCCGCGTTC 


420 


CGGGTCAAAG 


TTGGCGTTTT 


ATTATTATAG 


TCAGCTGACG 


TGTAGTGTAT 


TTATACCCGG 


480 
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TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 540 
TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA TTACCGAAGA 600 



AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA ATCTTCCACC 
TCCTAGCCAT TTTGAACCAC CTACCCTTCA CGAACTGTAT GATTTAGACG TGACGGCCCC 
CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA TGTTGGCGGT 
GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG AGCCGCCTCA 
CCTTTCCCGG CAGCCCGAGC A6CCGGAGCA GAGAGCCTTG GGTCCGGTTT CTATGCCAAA 
CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC CCAGTGACGA 
CGAGGATGAA GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG GGCACGGTTG 
CAGGTCTTGT CATTATCACC GGAGGAATAC GGGGGACCCA GATATTATGT GTTCGCTTTG 
CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTATGGG CAGTGGGTGA 
TAGAGTGGTG GGTTTGGTGT GGTAATTTTT TTTTTAATTT TTACAGTTTT GTGGTTTAAA 
GAATTTTGTA TTGTGATTTT TTTAAAAGGT CCTGTGTCTG AACCTGAGCC TGAGCCCGAG 
CCAGAACCGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC TATCCTGAGA 
CGCCCGACAT CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG TGACTCCGGT 
CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT TAAACCAGTT 
GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT GCTTAACGAG 
CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT AAACCTGTGA 
TTGCGTGTGT GGTTAACGCC TTTGTTTGCT GAATGAGTTG ATGTAAGTTT AATAAAGGGT 
GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCTTAAAGG GTATATAATG 
CGCCGTGGGC TAATCTTGGT TACATCTGAC CTCATGGAGG CTTGGGAGTG TTTGGAAGAT 
TTTTCTGCTG TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG GTTTTGGAGG 
TTTCTGTGGG GCTCATCCCA GGCAAAGTTA GTCTGCAGAA TTAAGGAGGA TTACAAGTGG 
GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA TCTGGGTCAC 
CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT TTTCCACACC GGGGCGCGCT 
GCGGCTGCTG TTGCTTTTTT GA6TTTTATA AAGGATAAAT GGAGCGAAGA AACCCATCTG 
AGCGGGGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT TGTGAGACAC 
AAGAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACCGAC GGAGGAGCAG 
CAGCAGCAGC AGGAGGAAGC CAGGCGGCGG CGGCAGGAGC AGAGCCCATG GAACCCGAGA 
GCCGGCCTGG ACCCTCGGGA ATGAATGTTG TACAGGTGGC TGAACTGTAT CCAGAACTGA 
GACGCATTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG AGGGAGCGGG 
GGGCTTGTGA GGCTACAGAG GAGGCTAGGA ATCTAGCTTT TAGCTTAATG ACCAGACACC 
GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG CTTGATCTGC 
TGGCGCAGAA GTATTCCATA GAGCAGCTGA CCACTTACTG GCTGCAGCCA GGGGATGATT 
TTGAGGAGGC TATTAGGGTA TATGCAAAGG TGGCACTTAG GCCAGATTGC AAGTACAAGA 



660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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TCAGCAAACT 


TGTAAATATC 


AGGAATTGTT 


GCTACATTTC 


TGGGAACGGG 


GCCGAGGTGG 


2640 


AGATAGATAC 


GGAGGATAGG 


GTGGCCTTTA 


GATGTAGCAT 


GATAAATATG 


TGGCCGGGGG 


"2700 


TGCTTGGCAT 


GGACGGGGTG 


GTTATTATGA 


ATGTAAGGTT 


TACTGGCCCC 


AATTTTAGCG 


2760 


GTACGGTTTT 


CCTGGCCAAT 


ACCAACCTTA 


TCCTACACGG 


TGTAAGCTTC 


TATGGGTTTA 


2820 


ACAATACCTG 


TGTGGAAGCC 


TGGACCGATG 


TAAGGGTTCG 


GGGCTGTGCC 


TTTTACTGCT 


2880 


GCTGGAAGGG 


GGTGGTGTGT 


CGCCCCAAAA 


GCAGGGCTTC 


AATTAAGAAA 


TGCCTCTTTG 


2940 


AAAGGTGTAC 


CTTGGGTATC 


CTGTCTGAGG 


GTAACTCCAG 


GGTGCGCCAC 


AATGTGGCCT 


3000 


CCGACTGTGG 


TTGCTTCATG 


CTAGTGAAAA 


GCGTGGCTGT 


GATTAAGCAT 


AACATGGTAT 


3060 


GTGGCAACTG 


CGAGGACAGG 


GCCTCTCAGA 


TGCTGACCTG 


CTCGGACGGC 


AACTGTCACC 


3120 


TGCTGAAGAC 


CATTCACGTA 


GCCAGCCACT 


CTCGCAAGGC 


CTGGCCAGTG 


TTTGAGCATA 


3180 


ACATACTGAC 


CCGCTGTTCC 


TTGCATTTGG 


GTAACAGGAG 


GGGGGTGTTC 


CTACCTTACC 


3240 


AATGCAATTT 


GAGTCACACT 


AAGATATTGC 


TTGAGCCCGA 


GAGCATGTCC 


AAGGTGAACC 


3300 


TGAACGGGGT 


GTTTGACATG 


ACCATGAAGA 


TCTGGAAGGT 


GCTGAGGTAC 


GATGAGACCC 


3360 


GCACCAGGTG 


CAGACCCTGC 


GAGTGTGGCG 


GTAAACATAT 


TAGGAACCAG 


CCTGTGATGC 


3420 


TGGATGTGAC 


CGAGGAGCTG 


AGGCCCGATC 


ACTTGGTGCT 


GGCCTGCACC 


CGCGCTGAGT 


3480 


TTGGCTCTAG 


CGATGAAGAT 


ACAGATTGAG 


GTACTGAAAT 


GTGTGGGCGT 


GGCTTAAGGG 


3540 


TGGGAAAGAA 


TATATAAGGT 


GGGGGTCTTA 


TGTAGTTTTG 


TATCTGTTTT 


GCAGCAGCCG 


3600 


CCGCCGCCAT 


GAGCACCAAC 


TCGTTTGATG 

• 


GAAGCATTGT 


GAGCTCATAT 


TTGACAACGC 


3660 


GCATGCCCCC 


ATGGGCCGGG 


GTGCGTCAGA 


ATGTGATGGG 


CTCCAGCATT 


GATGGTCGCC 


3720 


CCGTCCTGCC 


CGCAAACTCT 


ACTACCTTGA 


CCTACGAGAC 


CGTGTCTGGA 


ACGCCGTTGG 


3780 


AGACTGCAGC 


CTCCGCCGCC 


GCTTCAGCCG 


CTGCAGCCAC 


CGCCCGCGGG 


ATTGTGACTG 


3840 


ACTTTGCTTT 


CCTGAGCCCG 


CTTGCAAGCA 


GTGCAGCTTC 


CCGTTCATCC 


GCCCGCGATG 


3900 


ACAAGTTGAC 


GGCTCTTTTG 


GCACAATTGG 


ATTCTTTGAC 


CCGGGAACTT 


AATGTCGTTT 


3960 


CTCAGCAGCT 


GTTGGATCTG 


CGCCAGCAGG 


TTTCTGCCCT 


GAAGGCTTCC 


TCCCCTCCCA 


4020 


ATGCGGTTTA 


AAACATAAAT 


AAAAAACCAG 


ACTCTGTTTG 


GATTTGGATC 


AAGCAAGTGT 


4080 


CTTGCTGTCT 


TTATTTAGGG 


GTTTTGCGCG 


CGCGGTAGGC 


CCGGGACCAG 


CGGTCTCGGT 


4140 


CGTTGAGGGT 


CCTGTGTATT 


TTTTCCAGGA 


CGTGGTAAAG 


GTGACTCTGG 


ATGTTCAGAT 


4200 


ACATGGGCAT 


AAGCCCGTCT 


CTGGGGTGGA 


GGTAGCACCA 


CTGCAGAGCT 


TCATGCTGCG 


4260 


GGGTGGTGTT 


GTAGATGATC 


CAGTCGTAGC 


AGGAGCGCTG 


GGCGTGGTGC 


CTAAAAATGT 


4320 


CTTTCAGTAG 


CAAGCTGATT 


GCCAGGGGCA 


GGCCCTTGGT 


GTAAGTGTTT 


ACAAAGCGGT 


4380 


TAAGCTGGGA 


TGGGTGCATA 


CGTGGGGATA 


TGAGATGCAT 


CTTGGACTGT 


ATTTTTAGGT 


4440 


TGGCTATGTT 


CCCAGCCATA 


TCCCTCCGGG 


GATTCATGTT 


GTGCAGAACC 


ACCAGCACAG 


4500 


TGTATCCGGT 


GCACTTGGGA 


AATTTGTCAT 


GTAGCTTAGA 


AGGAAATGCG 


TGGAAGAACT 


4560 


TGGAGACGCC 


CTTGTGACCT 


CCAAGATTTT 


CCATGCATTC 


GTCCATAATG 


ATGGCAATGG 


4620 


GCCCACGGGC 


GGCGGCCTGG 


GCGAAGATAT 


TTCTGGGATC 


ACTAACGTCA 


TAGTTGTGTT 


4680 
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CCAGGATGAG ATCGTCATAG GCCATTTTTA CAAAGCGCGG GCGGAGGGTG CCAGACTGCG 4740 

GTATAATGGT TCCATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC ATTTCCCACG 4800 

CTTTGAGTTC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA ACGGTTTCCG 4860 

GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC TTACCGCAGC 4920 

CGGTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA GAGCTGCAGC 4980 

TGCCGTCATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTGACT CGCATGTTTT 5040 

CCCTGACCAA ATCCGCCAGA AGGCGCTCGC CGCCCAGCGA TAGCAGTTCT TGCAAGGAAG 5100 

CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC GTTTGACCAA 5160 

GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCGA TCCAGCATAT 5220 

CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGGC AGTAGTCGGT GCTCGTCCAG 5280 

ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG TCTGGGTCAC 5340 

GGTGAAGGGG TGCGCTCCGG GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC TGGTCCTGCT 5400 

GGTGCTGAAG CGCTGCCGGT CTTCGCCCTG CGCGTCGGCC AGGTAGCATT TGACCATGGT 5460 

GTCATAGTCC AGCCCCTCCG CGGCGTGGCC CTTGGCGCGC AGCTTGCCCT TGGAGGAGGC 5520 

GCCGCACGAG GGGCAGTGCA GACTTTTGAG GGCGTAGAGC TTGGGCGCGA GAAATACCGA 5580 

TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCCGCAGACG GTCTCGCATT CCACGAGCCA 5640 

GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATGCTTTT TGATGCGTTT 5700 

CTTACCTCTG GTTTCCATGA GCCGGTGTCC ACGCTCGGTG ACGAAAAGGC TGTCCGTGTC 5760 

CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCGGTGTT CCGCGGTCCT CCTCGTATAG 5820 

AAACTCGGAC CACTCTGAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG AGGCTAAGTG 5880 

GGAGGGGTAG CGGTCGTTGT CCACTAGGGG GTCCACTCGC TCCAGGGTGT GAAGACACAT 5940 

GTCGCCCTCT TCGGCATCAA GGAAGGTGAT TGGTTTGTAG GTGTAGGCCA CGTGACCGGG 6000 

TGTTCCTGAA GGGGGGCTAT T^AAAGGGGGT GGGGGCGCGT TCGTCCTCAC TCTCTTCCGC 6060 

ATCGCTGTCT GCGAGGGCCA GCTGTTGGGG TGAGTACTCC CTCTGAAAAG CGGGCATGAC 6120 

TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA CCTGGCCCGC 6180 

GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT TTTTGTTGTC 6240 

AAGCTTGGTG GCAAACGACC CGTAGAGGGC GTTGGACAGC AACTTGGCGA TGGAGCGCAG 6300 

GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT GCACGTATTC 6360 

GCGCGCAACG CACCGCCATT CGGGAAAGAC GGTGGTGCGC TCGTCGGGCA CCAGGTGCAC 6420 

GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT CTCCGCGTAG 6480 

GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCGAG CAGAATGGCG GTAGGGGGTC 6540 

TAGCTGCGTC TCGTCCGGGG GGTCTGCGTC CACGGTAAAG ACCCCGGGCA GCAGGCGCGC 6600 

GTCGAAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGCGCC TGCTGCCATG CGCGGGCGGC 6660 

AAGCGCGCGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG TGAGCGCGGA 6720 

GGCGTACATG CCGCAAATGT CGTAAACGTA GAGGGGCTCT CTGAGTATTC CAAGATATGT 6780 
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AGGGTAGCAT 


CTTCCACCGC 


GGATGCTGGC 


GCGCACGTAA 


TCGTATAGTT 


CGTGCGAGGG 


6840 


AGCGAGGAGG 


TCGGGACCGA 


GGTTGCTACG 


GGCGGGCTGC 


TCTGCTCGGA 


AGACTATCTG 


6900 


CCTGAAGATG 


GCATGTGAGT 


TGGATGATAT 


GGTTGGACGC 


TGGAAGACGT 


TGAAGCTGGC 


6960 


GTCTGTGAGA 


CCTACCGCGT 


CACGCACGAA 


GGAGGCGTAG 


GAGTCGCGCA 


GCTTGTTGAC 


7020 


CAGCTCGGCG 


GTGACCTGCA 


CGTCTAGGGC 


GCAGTAGTCC 


AGGGTTTCCT 


TGATGATGTC 


7080 


ATACTTATCC 


TGTCCCTTTT 


TTTTCCACAG 


CTCGCGGTTG 


AGGACAAACT 


CTTCGCGGTC 


7140 


TTTCCAGTAC 


TCTTGGATCG 


GAAACCCGTC 


GGCCTCCGAA 


CGGTAAGAGC 


CTAGCATGTA 


7200 


GAACTGGTTG 


ACGGCCTGGT 


AGGCGCAGCA 


TCCCTTTTCT 


ACGGGTAGCG 


CGTATGCCTG 


7260 


CGCGGCCTTC 


CGGAGC6AGG 


TGTGGGTGAG 


CGCAAAGGTG 


TCCCTGACCA TGACTTTGAG 


7320 


GTACTGGTAT 


TTGAAGTCAG 


TGTCGTCGCA 


TCCGCCCTGC 


TCCCAGAGCA AAAAGTCCGT 


7380 


GCGCTTTTTG 


GAACGCGGAT 


TTGGCAGGGC 


GAAGGTGACA 


TCGTTGAAGA 


GTATCTTTCC 


7440 


CGCGCGAGGC 


ATAAAGTTGC 


GTGTGATGCG 


GAAGGGTCCC 


GGCACCTCGG 


AACGGTTGTT 


7500 


AATTACCTGG 


GCGGCGAGCA 


CGATCTCGTC 


AAAGCCGTTG 


ATGTTGTGGC 


CCACAATGTA 


7560 


AAGTTCCAAG 


AAGCGCGGGA 


TGCCCTTGAT 


GGAAGGCAAT 


TTTTTAAGTT 


CCTCGTAGGT 


7620 


GAGCTCTTCA 


GGGGAGCTGA 


GCCCGTGCTC 


TGAAAGGGCC 


CAGTCTGCAA GATGAGGGTT 


7680 


GGAAGCGACG 


AATGAGCTCC 


ACAGGTCACG 


GGCCATTAGC 


ATTTGCAGGT 


GGTCGCGAAA 


7740 


GGTCCTAAAC 


TGGCGACCTA 


TGGCCATTTT 


TTCTGGGGTG 


ATGCAGTAGA 


AGGTAAGCGG 


7800 


GTCTTGTTCC 


CAGCGGTCCC 


ATCCAAGGTT 


CGCGGCTAGG 


TCTCGCGCGG 


CAGTCACTAG 


7860 


AGGCTCATCT 


CCGCCGAACT 


TCATGACCAG 


CATGAAGGGC 


ACGAGCTGCT 


TCCCAAAGGC 


7920 


CCCCATCCAA 


GTATAGGTCT 


CTACATCGTA 


GGTGACAAAG 


AGACGCTCGG 


TGCGAGGATG 


7980 


CGAGCCGATC 


GGGAAGAACT 


GGATCTCCCG 


CCACCAATTG 


GAGGA6TGGC 


TATTGATGTG 


8040 


GTGAAAGTAG 


AAGTCCCTGC 


GACGGGCCGA 


ACACTCGTGC 


TGGCTTTTGT 


AAAAACGTGC 


8100 


GCAGTACTGG 


CAGCGGTGCA 


CGGGCTGTAC 


ATCCTGCACG 


AGGTTGACCT 


GACGACCGCG 


8160 


CACAAGGAAG 


CAGAGTGGGA 


ATTTGAGCCC 


CTCGCCTGGC 


GGGTTTGGCT 


GGTGGTCTTC 


8220 


TACTTCGGCT 


GCTTGTCCTT 


GACCGTCTGG 


CTGCTCGAGG 


GGAGTTACGG 


TGGATCGGAC 


8280 


CACCACGCCG 


CGCGAGCCCA 


AAGTCCAGAT 


GTCCGCGCGC 


GGCGGTCGGA 


GCTTGATGAC 


8340 


AACATCGCGC 


AGATGGGAGC 


TGTCCATGGT 


CTGGAGCTCC 


CGCGGCGTCA 


GGTCAGGCGG 


8400 


GAGCTCCTGC 


AGGTTTACCT 


CGCATAGACG 


GGTCAGGGCG 


CGGGCTAGAT 


CCAGGTGATA 


8460 


CCTAATTTCC 


AGGGGCTGGT 


TGGTGGCGGC 


GTCGATGGCT 


TGCAAGAGGC 


CGCATCCCCG 


8520 


CGGCGCGACT 


ACGGTACCGC 


GCGGCGGGCG 


GTGGGCCGCG 


GGGGTGTCCT 


TGGATGATGC 


8580 


ATCTAAAAGC 


GGTGACGCGG 


GCGAGCCCCC 


GGAGGTAGGG 


GGGGCTCCGG 


ACCCGCCGGG 


8640 


AGAGGGGGCA 


GGGGCACGTC 


GGCGCCGCGC 


GCGGGCAGGA 


GCTGGTGCTG 


CGCGCGTAGG 


8700 


TTGCTGGCGA 


ACGCGACGAC 


GCGGCGGTTG 


ATCTCCTGAA 


TCTGGCGCCT 


CTGCGTGAAG 


8760 


ACGACGGGCC 


CGGTGAGCTT 


GAGCCTGAAA 


GAGAGTTCGA 


CAGAATCAAT 


TTCGGTGTCG 


8820 


TTGACGGCGG 


CCTGGCGCAA 


AATCTCCTGC 


ACGTCTCCTG 


AGTTGTCTTG 


ATAGGCGATC 


8880 
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TCGGCCATGA ACTGCTCGAT CTCTTCCTCC TGGAGATCTC CGCGTCCGGC TCGCTCCACG 8940 

GTGGCGGCGA GGTCGTTGGA AATGCGGGCC ATGAGCTGCG AGAAGGCGTT GAGGCCTCCC 9000 

TCGTTCCAGA CGCGGCTGTA GACCACGCCC CCTTCGGCAT CGCGGGCGCG CATGACCACC 9060 

TGCGCGAGAT TGAGCTCCAC GTGCCGGGCG AAGACGGCGT AGTTTCGCAG GCGCTGAAAG 9120 

AGGTAGTTGA GGGTGGTGGC GGTGTGTTCT GCCACGAAGA AGTACATAAC CCAGCGTCGC 9180 

AACGTGGATT CGTTGATATC CCCCAAGGCC TCAAGGCGCT CCATGGCCTC GTAGAAGTCC 9240 

ACGGCGAAGT TGAAAAACTG GGAGTTGCGC GCCGACACGG TTAACTCCTC CTCCAGAAGA 9300 

CGGATGAGCT CGGCGACAGT GTCGCGCACC TCGCGCTCAA AGGCTACAGG GGCCTCTTCT 9360 

TCTTCTTCAA TCTCCTCTTC CATAAGGGCC TCCCCTTCTT CTTCTTCTGG CGGCGGTGGG 9420 

GGAGGGGGGA CACGGCGGCG ACGACGGCGC ACCGGGAGGC GGTCGACAAA GCGCTCGATC 9480 

ATCTCCCCGC GGCGACGGCG CATGGTCTCG GTGACGGCGC GGCCGTTCTC GCGGGGGCGC 9540 

AGTTGGAAGA CGCCGCCCGT CATGTCCCGG TTATGGGTTG GCGGGGGGCT GCCATGCGGC 9600 

AGGGATACGG CGCTAACGAT GCATCTCAAC AATTGTTGTG TAGGTACTCC GCCGCCGAGG 9660 

GACCTGAGCG AGTCCGCATC GACCGGATCG GAAAACCTCT CGAGAAAGGC GTCTAACCAG 9720 

TCACAGTCGC AAGGTAGGCT GAGCACCGTG GCGGGCGGCA GCGGGCGGCG GTCGGGGTTG 9780 

TTTCTGGCGG AGGTGCTGCT GATGATGTAA TTAAAGTAGG CGGTCTTGAG ACGGCGGATG 9840 

GTCGACAGAA GCACCATGTC CTTGGGTCCG GCCTGCTGAA TGCGCAGGCG GTCGGCCATG 9900 

CCCCAGGCTT CGTTTTGACA TCGGCGCAGG TCTTTGTAGT AGTCTTGCAT GAGCCTTTCT 9960 

ACCGGCACTT CTTCTTCTCC TTCCTCTTGT CCTGCATCTC TTGCATCTAT CGCTGCGGCG 10020 

GCGGCGGAGT TTGGCCGTAG GTGGCGCCCT CTTCCTCCCA TGCGTGTGAC CCCGAAGCCC 10080 

CTCATCGGCT GAAGCAGGGC TAGGTCGGCG ACAACGCGCT CGGCTAATAT GGCCTGCTGC 10140 

ACCTGCGTGA GGGTAGACTG GAAGTCATCC ATGTCCACAA AGCGGTGGTA TGCGCCCGTG 10200 

TTGATGGTGT AAGTGCAGTT GGCCATAACG GACCAGTTAA CGGTCTG6TG ACCCGGCTGC 10260 

GAGAGCTCGG TGTACCTGAG ACGCGAGTAA GCCCTCGAGT CAAATACGTA GTCGTTGCAA 10320 

GTCCGCACCA GGTACTGGTA TCCCACCAAA AAGTGCGGCG GCGGCTGGCG GTAGAGGGGC 10380 

CAGCGTAGGG TGGCCGGGGC TCCGGGGGCG AGATCTTCCA ACATAAGGCG ATGATATCCG 10440 

TAGATGTACC TGGACATCCA GGTGATGCCG GCGGCGGTGG TGGAGGCGCG CGGAAAGTCG 10500 

CGGACGCGGT TCCAGATGTT GCGCAGCGGC AAAAAGTGCT CCATGGTCGG GACGCTCTGG 10560 

CCGGTCAGGC GCGCGCAATC GTTGACGCTC TAGACCGTGC AAAAGGAGAG CCTGTAAGCG 10620 

GGCACTCTTC CGTGGTCTGG TGGATAAATT CGCAAGGGTA TCATGGCGGA CGACCGGGGT 10680 

TCGAGCCCCG TATCCGGCCG TCCGCCGTGA TCCATGCGGT TACCGCCCGC GTGTCGAACC 10740 

CAGGTGTGCG ACGTCAGACA ACGGGGGAGT GCTCCTTTTG GCTTCCTTCC AGGCGCGGCG 10800 

GCTGCTGCGC TAGCTTTTTT GGCCACTGGC CGCGCGCAGC GTAAGCGGTT AGGCTGGAAA 10860 

GCGAAAGCAT TAAGTGGCTC GCTCCCTGTA GCCGGAGGGT TATTTTCCAA GGGTTGAGTC 10920 

GCGGGACCCC CGGTTCGAGT CTCGGACCGG CCGGACTGCG GCGAACGGGG GTTTGCCTCC 10980 
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CCGTCATGCA 


AGACCCCGCT 


TGCAAATTCC 


TPPGG A A A P 2X 


ACGAG CC 


CCTTTTTTGC 


11040 


TTTTCCCAGA 


TGCATCCGGT 


GCTGCGGCAG 




t-xCCTCAGCA 


GCGGCAAGAG 


11100 


CAAGAGCAGC 


GGCAGACATG 


CAGGGCACCr 


TPPPPTPPTP 


^^^^^^^ 
I- X ACCGCGTC 


AGGAGGGGCG 


11160 


ACATCCGCGG 


TTGACGCGGC 


AGCAGATGGT 


/^ATTapna ap 


CCCCGCGGCG 


CCGGGCCCGG 


11220 


CACTACCTGG 


ACTTGGAGGA 


GGGPGAGGnP 




TAGGAGCGCC 


CTCTCCTGAG 


11280 


CGGTACCCAA 


GGGTGCAGCT 






CGTACGTGCC 


GCGGCAGAAC 


11340 


CTGTTTCGCG 


ACCGC6AGGG 


AGAGGAGPPP 


p a <'2P a p a •p^o 


GGGATCGAAA 


GTTCCACGCA 


11400 


GGGCGCGAGC 


TGCGGCATGG 


CCTGAATPGP 


mv^PriPTTPP 


TGCGCGAGGA GGACTTTGAG 


11460 


CCCGACGCGC 


GAACCGGGAT 


TAGTCCCGPR 


prspfipa p a r*/^ 
v.. oL-^U AL. AUu 


TGGCGGCCGC 


CGACCTGGTA 


11520 


ACCGCATACG 


AGCAGACGGT 


GAACCAGGAG 


/iX XniiU X X XV. 


AAAAAAGCTT 


TAACAACCAC 


11580 


GTGCGTACGC 


TTGTGGCGCG 


CGAGGAGGTG 


^PT a a P/^ a 


TGATGCATCT 


GTGGGACTTT 


11640 


GTAAGCGCGC 


TGGAGCAAAA 


CCCAAATAGP 


AnnpppPTPa 


TGGCGCAGCT 


GTTCCTTATA 


11700 


GTGCAGCACA 


GCAGGGACAA 


CG AGGP A TTP 


2i r!/T!P a TP PP 1^ 


TGCTAAACAT 


AGTAGAGCCC 


11760 


GAGGGCCGCT 


GGCTGCTCGA 


TTTGATAAAP 


ATPPTPPar^ a 


GCATAGTGGT 


GCAGGAGCGC 


11820 


AGCTTGAGCC 


TGGCTGACAA 




ATpa a Pfa 
Ax V-AAt-TA T T 


CCATGCTTAG 


CCTGGGCAAG 


11680 


TTTTACGCCC 


GCAAGATATA 


PP ATa PPPPT 


1 ACexxCCC A 


TAGACAAGGA GGTAAAGATC 


11940 


GAGGGGTTCT 






GTGCTTACCT 


TGAGCGACGA 


CCTGGGCGTT 


12000 


TATCGCAACG 




P2i Zi^S/^PPnTP 


AvjCGTGAGCC 


GGCG6CGCGA 


GCTCAGCGAC 


12060 


CGCGAGCTGA 


TGCACAGCPT 




LTGGCTGGCA 


CGGGCAGCGG 


CGATAGAGAG 


12120 


GCCGAGTCCT 






CTGCGCTGGG 


CCCCAAGCCG 


ACGCGCCCTG 


12180 


GAGGCAGCTG 


GGGCCGGACP 


TGG/^PTPPPP 


(J i (ivjCACCCG 


CGCGCGCTGG 


CAACGTCGGC 


12240 


GGCGTGGAGG 


AATATGAPGa 


GG2i Pf2 a TP a p 


TACGAGCCAG 


AGGACGGCGA 


GTACTAAGCG 


12300 


GTGATGTTTC 


TGATCAGATG 




CAACGGACCC 


GGCGGTGCGG 


GCGGCGCTGC 


12360 


AGAGCCAGCC 


GTCCGGCCTT 


AAPTPPaPfSn 


AuuACTGGCG 


CCAGGTCATG 


GACCGCATCA 


12420 


TGTCGCTGAC 


TGCGCGCAAT 


p PTG 2i r*cir*n t 


1 L.C6GCAGCA 


GCCGCAGGCC 


AACCGGCTCT 


12480 


CCGCAATTCT 


GGAAGCGGTG 


GTPPPGGPnp 


rtpppa a a r*r^^ 
uk.ijL.AAALUv. 


CACGCACGAG 


AAGGTGCTGG 


12540 


CGATCGTAAA 


CGCGCTGGCC 


G AA & A P 2V f^rin 


ppaT^r'^/^^^ 
uuA i uuuGCC 


CGACGAGGCC 


GGCCTGGTCT 


12600 


ACGACGCGCT 


GCTTCAGCGC 


GTGGPTPGTT 


A Pa apap/^r'/^ 


CAACGTGCAG 


ACCAACCTGG 


12660 


ACCGGCTGGT 


GGGGGATGTG 




1 VjUUbuAGCG 


TGAGCGCGCG 


CAGCAGCAGG 


12720 


GCAACCTGGG 


CTCCATGGTT 


fip 21 PT n 2i a pn 


u UTTCCTGAG 


TACACAGCCC 


GCCAACGTGC 


12780 


CGCGGGGACA 


GGAGGACTAr 


APPittPTTT'^^ 


*Pff appp/'*a ^>T< 


GCGGCTAATG 


GTGACTGAGA 


12840 




TGAGGTGTAC 


CAGTCTGGGC 


CAGACTATTT 


TTTCCAGACC 


AGTAGACAAG 


12900 


GCCTGCAGAC 


CGTAAACCTG 


AGCCAGGCTT 


TCAAAAACTT 


GCAGGGGCTG 


TGGGGGGTGC 


12960 


GGGCTCCCAC 


AGGCGACCGC 


GCGACCGTGT 


CTAGCTTGCT 


GACGCCCAAC 


TCGCGCCTGT 


13020 


TGCTGCTGCT 


AATAGCGCCC 


TTCACGGACA 


GTGGCAGCGT 


GTCCCGGGAC 


ACATACCTAG 


13080 
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GTCACTTGCT GACACTGTAC CGCGAGGCCA TAGGTCAGGC GCATGTGGAC GAGCATACTT 13140 
TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC AGCCTGGAGG 13200 
CAACCCTAAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG CACAGTTTAA 13260 
ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT AACCTGATGC 13320 
GCGACGGGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GAACCGGGCA 
TGTATGCCTC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT CGCGCGGCCG 
CCGTGAACCC CGAGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA CCGCCCCCTG 
GTTTCTACAC CGGGGGATTC GAGGTGCCCG AGGGTAACGA TGGATTCCTC TGGGACGACA 
TAGACGACAG CGTGTTTTCC CCGCAACCGC AGACCCTGCT AGAGTTGCAA CAGCGCGAGC 
AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTG TCCGATCTAG 
GCGCTGCGGC CCCGCGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA GGGTCTCTTA 
CCAGCACTCG CACCACCCGC CCGCGCCTGC TGGGCGAGGA GGAGTACCTA AACAACTCGC 
TGCTGCAGCC GCAGCGCGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC GGGATAGAGA 
GCCTAGTGGA CAAGATGAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG GACGTGCCAG 
GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG GTGTGGGAGG 
ACGATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC AACCCGTTTG 
CGCACCTTCG CCCCAGGCTG GGGAGAATGT TTTAAAAAAA AAAAAGCATG ATGCAAAATA 
AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC TTAGTATGCG 
GCGCGCGGCG ATGTATGAGG AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC 
GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC 
TCCGCGGTAC CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG TGGCATCCCT 
GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC ATTCAAAACA ATGACTACAG 
CCCGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT GGGGCGGCGA 
CCTGAAAACC ATCCTGCATA CCAACATGCC AAATGTGAAC GAGTTCATGT TTACCAATAA 
GTTTAAGGCG CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA TGACCATAGA 
CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA ACGGGGTTCT 
GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT TTGACCCCGT 
CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT 
GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG AGGGTGGTAA 
CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG ACACCGAACA 
GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG AGAACTCCAA 
CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA 
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14220 
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14400 
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CACCTTTCSPr* 


A^ aUVjooV. i Vj 


AOV3AGAAGCG 


CGCTGAGGCC 


GAAGCAGCGG 


PCT/US97/19541 

CCGAAGCTGC 15240 






AvjiCs rCGAGAA 


GCCTCAGAAG 


AAACCGGTGA 


TCAAACCCCT 


15300 




aLj L. aaOaaaLi 


^1 A I'liin A ^ TV TV 

vat-AGTTACAA 


CCTAATAAGC 


AATGACAGCA 


CCTTCACCCA 


15360 


GTAC CGP AG r 


X X X X O 


L. AU. AwAACTA 


CGGCGACCCT 


CAGACCGGAA 


TCCGCTCATG 


15420 


GACCCTGCTT 


Tnp & PTPPTrs 


ACu X AAL. C TG 


CGGCTCGGAG 


CAGGTCTACT 


GGTCGTTGCC 


15480 


AGACATGATf; 




TGACCTTCCG 


CTCCACGCGC 


CAGATCAGCA 


ACTTTCCGGT 


15540 


wv9 X w>Jv3 wO ^ 


^aVjL. X i X VsL. 


r^t*\t ••11/ ^im^i 

CCGTGCACTC 


CAAGAGCTTC 


TACAACGACC 


AGGCCGTCTA 


15600 


w X V# w W X w 


Zi TPPPPP P T» 


rXACCTCTCT 


GACCCACGTG 


TTCAATCGCT 


TTCCCGAGAA 


15660 


W \a« Awn X X X X V3 


u u u U V, vj L. 


CAGCCCCCAC 


CATCACCACC 


GTCAGTGAAA 


ACGTTCCTGC 


15720 


X v.. X VmA^AUA Jk 




TACCGCTGCG 


CAACAGCATC 


GGAGGAGTCC 


AGCGAGTGAC 


15780 


V-AX XA\« XOAVv 




vjCACCTGCCC 


CTACGTTTAC 


AAGGCCCTGG 


GCATAGTCTC 


15840 


GCCGCGCGTr 


PTZiTPP JV/SPP 
Vi» X aX wV^aOVoW 


PO 71 »» twill T>/^ 

Gv-ACTTTTTG 


AGCAAGCATG 


TCCATCCTTA 


TATCGCCCAG 


15900 


V— AA X AAw ALa.A 




^^^^ ^^^^ ^v^nnv^*t^^^*t 

TGCGCTTCCC 


AAGCAAGATG 


TTTGGCGGGG 


CCAAGAAGCG 


15960 


X V«wVrA\«V>AA 


^A^UuAu X vjC 


GCGTGCGCGG 


GCACTACCGC 


GCGCCCTGGG 


GCGCGCACAA 


16020 






CCACCGTCGA 


TGACGCCATC 


GACGCGGTGG 


TGGAGGAG6C 


16080 


w\«V3^Aa^ X a^ 




CGCCACCAGT 


GTCCACAGTG 


GACGCGGCCA 


TTCAGACCGT 


16140 




GCCCGGCGCT 


ATGCTAAAAT 


GAAGAGACGG 


CGGAGGCGCG 


TAGCACGTCG 


16200 




u(jAL.C. CGGCA 


CTGCCGCCCA 


ACGCGCGGCG 


GCGGCCCTGC 


TTAACCGCGC 


16260 


AW>V7 X \.k3WA^»^ 




CGGCCATGCG 


GGCCGCTCGA 


AGGCTGGCCG 


CGGGTATTGT 


16320 


PAPTdTrSPPP 

wAw 1 V9 X\J^l.^ 




GGCGACGAGC 


GGCCGCCGCA 


GCAGCCGCGG 


CCATTAGTGC 


16380 


1a X VlAli_ 1 \-J\\3 


GGTCGCAGGG 


GCAACGTGTA 


TTGGGTGCGC 


GACTCGGTTA 


GCGGCCTGCG 


16440 


PfSTCPPPnTr' 


v.\j(.AwCCGCC 


CCCCGCGCAA 


CTAGATTGCA 


AGAAAAAACT 


ACTTAGACTC 


16500 


V3 X aV» 1 o 1 1 V3 1 


A X Vj i ATCC AG 


CGGCGGCGGC 


GCGCAACGAA 


GCTATGTCCA AGCGCAAAAT 


16560 


n>aaauaaVj/\Vj 


A X VjCTCCAGG 


TCATCGCGCC 


GGAGATCTAT 


GGCCCCCCGA AGAAGGAAGA 


16620 


^U.Aooa1 xi^L. 


AAGCCCCGAA 


AGCTAAAGCG 


GGTCAAAAAG 


AAAAAGAAAG 


ATGATGATGA 


16680 


X VJAAXw X 1 AV» 




AACTGCTGCA 


CGCTACCGCG 


CCCAGGCGAC 


GGGTACAGTG 


16740 


UAAAuva 1 


UGCGTAAAAC 


GTGTTTTGCG 


ACCCGGCACC 


ACCGTAGTCT 


TTACGCCCGG 


16800 


TG AG racTcr' 




ACAAGCGCGT 


GTATGATGAG 


GTGTACGGCG 


ACGAGGACCT 


16860 




r^f> TV * TV 

^^CCAACGAGC 


GCCTCGGGGA 


GTTTGCCTAC 


GGAAAGCGGC 


ATAAGGACAT 


16920 




L.uvav.TGGACG 


^ ^^^^^^^^^ ^v^m^v 

AGGGCAACCC 


AACACCTAQC 


CTAAAGCCCG 


TAACACTGCA 


16980 




C.CCGCGCTTG 


CACCGTCCGA 


AGAAAAGCGC 


GGCCTAAAGC 


GCGAGTCTGG 


^ ^ f\A r\ 


X\3A^ X XV70^A 


L. L. C ACvJGTGC 


AGCTGATGGT 


ACCCAAGCGC 


CAGCGACTGG 


AAGATGTCTT 


17100 


GGAAAAAATG 


ACCGTGGAAC 


CTGGGCTGGA 


GCCCGAGGTC 


CGCGTGCGGC 


CAATCAAGCA 


17160 


GGTGGCGCCG 


GGACTGGGCG 


TGCAGACCGT 


GGACGTTCAG 


ATACCCACTA 


CCAGTAGCAC 


17220 


CAGTATTGCC 


ACCGCCACAG 


AGGGCATGGA 


GACACAAACG 


TCCCCGGTTG 


CCTCAGCGGT 


17280 
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GGCGGATGCC GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA C6GAGGTGCA 17340 

AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCGGTT CGAGGAAGTA 17400 

CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG CGCCTACCCC 17460 

CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACGAGCA ACTACCCGAC GCCGAACCAC 17520 

CACTGGAACC CGGCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG 17580 

CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 17640 

CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT GCCGCCTCCG 17700 

TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG CCGGCCACGG 17760 

CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT 17820 

GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTQATCGCC GCGGCGATTG GCGCCGTGCC 17880 

CGGAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 17940 

GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA CTATTTTGTA 18000 

GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA CGGCTCGCGC CCGTTCATGG 18060 

GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC 18120 

TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA 18180 

ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 18240 

TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC CAGGCAGTGC 18300 

AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT CCACCGGCCG 18360 

TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA 18420 

CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC 18480 

CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 18540 

CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA GGCCCGACCG 18600 

CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC GGTCCGCGAT 18660 

CGTTGCGGCC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC GTGGGTCTGG 18720 

GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC TAACGTGTCG TATGTGTGTC 18780 

ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCGCGCGCC CGCTTTCCAA 18840 

GATGGCTACC CCTTCGATGA TGCCGCAGTG GTCTTACATG CACATCTCGG GCCAGGACGC 18900 

CTCGGAGTAC CTGAGCCCCG GGCTGGTGCA GTTTGCCCGC GCCACCGAGA CGTACTTCAG 18960 

CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGCGCCTACG CACGACGTGA CCACAGACCG 19020 

GTCCCAGCGT TTGACGCTGC GGTTCATCCC TGTGGACCGT GAGGATACTG CGTACTCGTA 19080 

CAAGGCGCGG TTCACCCTAG CTGTGGGTGA TAACCGTGTG CTGGACATGG CTTCCACGTA 19140 

CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT CTGGCACTGC 19200 

CTACAACGCC CTGGCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG AAGCTGCTAC 19260 

TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG TAGACGAGCA 19320 

AGCTGAGCAG CAAAAAACTC ACGTATTTGG GCAGGCGCCT TATTCTGGTA TAAATATTAC 19380 
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AAAGGAGGGT 


ATTCAAATAG 


GTGTPGAAGG 


Tpaaapap/^T 

X ^AAA\«A\»W 1 


AAA T A 1*0 000 

AAATATGCCG 


ATAAAACATT 


19440 


TCAACCTGAA 


CCTCAAATAG 


GAGAATPTPA 


GTGGT a pn a a 

u X OVj X A^uAA 


A OT^O AAA 'llf l> A 

AC TG AAATTA 


ATCATGCAGC 


19500 


TGGGAGAGTC 


CTTAAAAAGA 


PTAPPPPAAT 


rsaaappaTPT 

urAAACL.A 1 X 


•p A /^/^OT'^P/^ A 1* 

TACGGTTCAT 


ATGCAAAACC 


19560 


CACAAATGAA 


AATGGAGGGC 


AAGGCATTPT 


TGTZVAAGPaa 


i^aaAATOOA A 

V- AAAA X (joAA 


AGCTAGAAAG 


19620 


TCAAGTGGAA 


ATG CAATTTT 

• •XwWw^X XXX 


TCTC AAPTA P 


TGAGGPGaPP 


0 0 A 0 0 0 A T\ rp/~» 

VjCAVjGCAATG 


GTGATAACTT 


19680 


GACTCCTAAA 


GTGGTATTGT 

>0 X W\S X X X \9 X 


APAGTGAAGA 


TGTa^2aTaTa 

xvalAljAxAXA 


A A A ^f^^/^ A 0 

(jAAACC C C AG 


ACACTCATAT 


19740 


TTCTTACATG 


CCCACTATTA 


AGG AAGGT A A 


PTPapGapaa 

^ 1 ^ A^uAuAA 


OT* A A*POOOOO 

CTAATGGGCC 


AACAATCTAT 


19800 


GCCCAACAGG 


CCTAATTACA 

xnj^x x^%^^% 


TTG PTTTT A G 
X x\av« XXX xavj 


^^apa a*i*'i"i"T' 

OVaA^AAX ill 


A X TGGTCTAA 


TGTATTACAA 


19860 


CAGCACGGGT 


AATATGCsGTG 


TTPTGGPGGG 


PP a a np a Tr»r» 
^CAAvjCA X Cu 


0 A f '<f 1 i>no A TV mo 

CAGTTGAATG 


CTGTTGTAGA 


19920 


TTTGCAAGAC 


AGAAACACAG 


AGPTTTPaTa 

XXX V»A Xi\ 


CCAk^CX X X XVji 


OTWPO A f llf TIO 0 A 

CTTGATTCCA 


TTGGTGATAG 


19980 


AACCAGGTAC 


TTTTf^JXTf-T 




Xvs X xvjACAGC 


TATGATCCAG 


ATGTTAGAAT 


20040 


TATTGAAAAT 


CATGGAIVPTG 


a aGa Tr^a a ni* 


♦Pf^^A A ATT* A 

X CCAAATTAC 


TGCTTTC C AC 


TGGGAGGTGT 


20100 


GATTAATACA 




^ L.ii>iLj(j X AAA 


A A A A TV A 

ACCTAAAACA 


GGTCAGGAAA 


ATGGATGGGA 


20160 


AAAAG ATG CT 


Af AGAATTTT 


r^aciaTa a a a a 


TUAAATAAGA 


GTTGGAAATA 


ATTTTGCCAT 


20220 


GGAAATCAAT 






AAATTTCCTG 


TACTCCAACA 


TAGCGCTGTA 


20280 


TTTGCCCGAC 


A AGPT& 2V Am* 




OA AO^fnA AAA 

CAACGTAAAA 


ATTTCTGATA 


ACCCAAACAC 


20340 


CTACGArTAP 


2VTG2i zv p a a/nr* 




TCCCGGGTTA 


GTGGACTGCT 


ACATTAACCT 


20400 


TGGAGCACG P 


TGGT PP PTTP 
X WW X ^\«V» 1 X o 


AL. i AlAlvjoA 


CAACGTCAAC 


CCATTTAACC 


ACCACCGCAA 


20460 






L. AA 1 V9 X TCaC T 


GGGCAATGGT 


CGCTATGTGC 


CCTTCCACAT 


20520 


C C AGGTG r CT 




X lvjv.(.Al XAA 


AAACCTCCTT 


CTCCTGCCGG 


GCTCATACAC 


20580 


CTACGAGTGG 


AAPTTPAGna 


AVj^jAIox lAA 


CATGGTTCTG 


CAGAGCTCCC 


TAGGAAATGA 


20640 


CCTAAGGGTT 


GAPGGAGPPa 


j^p a TT a a r'TT 


X uATAGCATT 


TGCCTTTACG 


CCACCTTCTT 


20700 


CCCCATGGCP 


PAPHPAPPG 


p r"T*r^r^a ot 
Uk. 1 ^CAVvVjU i 


TGAGGCCATG 


CTTAGAAACG 


ACACCAACGA 


20760 


CCAGTCCTTT 


AAPGAPTaTP 




OA A 0 A 0 

CAACATGCTC 


TACCCTATAC 


CCGCCAACGC 


20820 




pppaTaTr^r'a 




CAACTGGGCG 


GCTTTCCGCG 


GCTGGGCCTT 


20880 




AAfsapTaa/^/^ 


AAAL. L U CATC 


ACTGGGCTCG 


GGCTACGACC 


CTTATTACAC 


20940 


V— X X ^ X v» 


Tr^T* a T* a /^r'/^'p 

1 W liiX AL.\.d 




AACCTTTTAC 


CTCAACCACA 


CCTTTAAGAA 


21000 


GGTGGrrATT 


A PP*T*T"T'^ a r^'P 


C X 1 C X v,3 X L. AVj 


CTGGCCTGGC 


AATGACCGCC 


TGCTTACCCC 


21060 


CAACGAGTTT 




VjC X CAvj X X. (jA 


00000 A 00 O*^ 

CGGGGAGGGT 


TACAACGTTG 


CCCAGTGTAA 


21120 


CATGACCAAA 


X \JU X X V»Vm 


Tra^Tapa a aT 
xurulAVAAAX 


OOfAOOfA A 0 

bCTAGCTAAC 


TAcAACATTG 


GCTACCAGGG 


21180 


CTTCTATATC 

W X X %v X f% X#% X ^» 


PPAGanaGPT 


AC AAViVj AC CV,> 


0 A T'OT' A /'ilflOO 

CATGTACTCC 


TTCTTTAGAA 


ACTTCCAGCC 


21240 


CATGAGCCGT 


CAGGTG6TGG 


ATGATACTAA 


ATACAAGGAC 


TACCAACAGG 


TGGGCATCCT 


21300 


ACACCAACAC 


AACAACTCTG 


GATTTGTTGG 


CTACCTTGCC 


CCCACCATGC 


GCGAAGGACA 


21360 


GGCCTACCCT 


GCTAACTTCC 


CCTATCCGCT 


TATAGGCAAG 


ACCGCAGTTG 


ACAGCATTAC 


21420 


CCAGAAAAAG 


TTTCTTTGCG 


ATCGCACCCT 


TTGGCGCATC 


CCATTCTCCA 


GTAACTTTAT 


21480 
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GTCCATGGGC GCACTCACAG ACCTGGGCCA AAACCTTCTC TACGCCAACT CCGCCCACGC 21540 
GCTAGACATG ACTTTTGAGG TGGATCCCAT GGACGAGCCC ACCCTTCTTT ATGTTTTGTT 21600 



TGAAGTCTTT GACGTGGTCC GTGTGCACCG GCCGCACCGC GGCGTCATCG AAACCGTGTA 
CCTGCGCACG CCCTTCTCGG CCGGCAACGC CACAACATAA AGAAGCAAGC AACATCAACA 
ACAGCTGCCG CCATGGGCTC CAGTGAGCAG GAACTGAAAG CCATTGTCAA AGATCTTGGT 
TGTGGGCCAT ATTTTTTGGG CACCTATGAC AAGCGCTTTC CAGGCTTTGT TTCTCCACAC 
AAGCTCGCCT GCGCCATAGT CAATACGGCC GGTCGCGAGA CTGGGGGCGT ACACTGGATG 
GCCTTTGCCT GGAACCCGCA CTCAAAAACA TGCTACCTCT TTGAGCCCTT TGGCTTTTCT 
GACCAGCGAC TCAAGCAGGT TTACCAGTTT GAGTACGAGT CACTCCTGCG CCGTAGCGCC 
ATTGCTTCTT CCCCCGACCG CTGTATAACG CTGGAAAAGT CCACCCAAAG CGTACAGGGG 
CCCAACTCGG CCGCCTGTGG ACTATTCTGC TGCATGTTTC TCCACGCCTT TGCCAACTGG 
CCCCAAACTC CCATGGATCA CAACCCCACC ATGAACCTTA TTACCGGGGT ACCCAACTCC 
ATGCTCAACA GTCCCCAGGT ACAGCCCACC CTGCGTCGCA ACCAGGAACA GCTCTACAGC 
TTCCTGGAGC GCCACTCGCC CTACTTCCGC AGCCACAGTG CGCAGATTAG GAGCGCCACT 
TCTTTTTGTC ACTTGAAAAA CATGTAAAAA TAATGTACTA GAGACACTTT CAATAAAGGC 
AAATGCTTTT ATTTGTACAC TCTCGGGTGA TTATTTACCC CCACCCTTGC CGTCTGCGCC 
GTTTAAAAAT CAAAGGGGTT CTGCCGCGCA TCGCTATGCG CCACTGGCAG GGACACGTTG 
CGATACTGGT GTTTAGTGCT CCACTTAAAC TCAGGCACAA CCATCCGCGG CAGCTCGGTG 
AAGTTTTCAC TCCACAGGCT GCGCACCATC ACCAACGCGT TTAGCAGGTC GGGCGCCGAT 
ATCTTGAAGT CGCAGTTGGG GCCTCCGCCC TGCGCGCGCG AGTTGCGATA CACAGGGTTG 
CAGCACTGGA ACACTATCAG CGCCGGGTGG TGCACGCTGG CCAGCACGCT CTTGTCGGAG 
ATCAGATCCG CGTCCAGGTC CTCCGCGTTG CTCAGGGCGA ACGGAGTCAA CTTTGGTAGC 
TGCCTTCCCA AAAAGGGCGC GTGCCCAGGC TTTGAGTTGC ACTCGCACCG TAGTGGCATC 
AAAAGGTGAC CGTGCCCGGT CTGGGCGTTA GGATACAGCG CCTGCATAAA AGCCTTGATC 
TGCTTAAAAG CCACCTGAGC CTTTGCGCCT TCAGAGAAGA ACATGCCGCA AGACTTGCCG 
GAAAACTGAT TGGCCGGACA GGCCGCGTCG TGCACGCAGC ACCTTGCGTC GGTGTTGGAG 
ATCTGCACCA CATTTCGGCC CCACCGGTTC TTCACGATCT TGGCCTTGCT AGACTGCTCC 
TTCAGCGCGC GCTGCCCGTT TTCGCTCGTC ACATCCATTT CAATCACGTG CTCCTTATTT 
ATCATAATGC TTCCGTGTAG ACACTTAAGC TCGCCTTCGA TCTCAGCGCA GCGGTGCAGC 
CACAACGCGC AGCCCGTGGG CTCGTGATGC TTGTAGGTCA CCTCTGCAAA CGACTGCAGG 
TACGCCTGCA GGAATCGCCC CATCATCGTC ACAAAGGTCT TGTTGCTGGT GAAGGTCAGC 
TGCAACCCGC GGTGCTCCTC GTTCAGCCAG GTCTTGCATA CGGCCGCCAG AGCTTCCACT 
TGGTCAGGCA GTAGTTTGAA GTTCGCCTTT AGATCGTTAT CCACGTGGTA CTTGTCCATC 
AGCGCGCGCG CAGCCTCCAT GCCCTTCTCC CAC6CAGACA CGATCGGCAC ACTCAGCGGG 
TTCATCACCG TAATTTCACT TTCCGCTTCG CTGGGCTCTT CCTCTTCCTC TTGCGTCCGC 
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CTGCAGCTAG CCAACTACCT TGCCTACCAC TCTGACATAA TGGAAGACGT GAGCGGTGAC 25740 
GGTCTACTGG AGTGTCACTG TCGCTGCAAC CTATGCACCC CGCACCGCTC CCTGGTTTGC 25800 
AATTCGCAGC TGCTTAACGA AAGTCAAATT ATCGGTACCT TTGAGCTGCA GGGTCCCTCG 25860 
CCTGACGAAA AGTCCGCGGC TCCGGGGTTG AAACTCACTC CGGGGCTGTG GACGTCGGCT 25920 
TACCTTCGCA AATTTGTACC TGAGGACTAC CACGCCCACG AGATTAGGTT CTACGAAGAC 25980 
CAATCCCGCC CGCCAAATGC GGAGCTTACC GCCTGCGTCA TTACCCAGGG CCACATTCTT 
GGCCAATTGC AAGCCATCAA CAAAGCCCGC CAAGAGTTTC TGCTACGAAA GGGACGGGGG 
GTTTACTTGG ACCCCCAGTC CGGCGAGGAG CTCAACCCAA TCCCCCCGCC GCCGCAGCCC 
TATCAGCAGC AGCCGCGGGC CCTTGCTTCC CAGGATGGCA CCCAAAAAGA AGCTGCAGCT 
GCCGCCGCCA CCCACGGACG AGGAGGAATA CTGGGACAGT CAGGCAGAGG AGGTTTTGGA 
CGAGGAGGAG GAGGACATGA TGGAAGACTG GGAGAGCCTA GACGAGGAAG CTTCCGAGGT 
CGAAGAGGTG TCAGACGAAA CACCGTCACC CTCGGTCGCA TTCCCCTCGC CGGCGCCCCA 
GAAATCGGCA ACCGGTTCCA GCATGGCTAC AACCTCCGCT CCTCAGGCGC CGCCGGCACT 
GCCCGTTCGC CGACCCAACC GTAGATGGGA CACCACTGGA ACCAGGGCCG GTAAGTCCAA 
GCAGCCGCCG CCGTTAGCCC AAGAGCAACA ACAGCGCCAA GGCTACCGCT CATGGCGCGG 
GCACAAGAAC GCCATAGTTG CTTGCTTGCA AGACTGTGGG GGCAACATCT CCTTCGCCCG 
CCGCTTTCTT CTCTACCATC ACGGCGTGGC CTTCCCCCGT AACATCCTGC ATTACTACCG 
TCATCTCTAC AGCCCATACT GCACCGGCGG CAGCGGCAGC GGCAGCAACA GCAGCGGCCA 
CACAGAAGCA AAGGCGACCG GATAGCAAGA CTCTGACAAA GCCCAAGAAA TCCACAGCGG 
CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA CGAACCCGTA TCGACCCGCG 
AGCTTAGAAA CAGGATTTTT CCCACTCTGT ATGCTATATT TCAACAGAGC AGGGGCCAAG 
AACAAGAGCT GAAAATAAAA AACAGGTCTC TGCGATCCCT CACCCGCAGC TGCCTGTATC 
ACAAAAGCGA AGATCAGCTT CGGCGCACGC TGGAAGACGC GGAGGCTCTC TTCAGTAAAT 
ACTGCGCGCT GACTCTTAAG GACTAGTTTC GCGCCCTTTC TCAAATTTAA GCGCGAAAAC 
TACGTCATCT GCAGCGGCCA CACCCGGCGC CAGCACCTGT CGTCAGCGCC ATTATGAGCA 
AGGAAATTCC CACGCCCTAC ATGTGGAGTT ACCAGCCACA AATGGGACTT GCGGCTGGAG 
CTGCCCAAGA CTACTCAACC CGAATAAACT ACATGAGCGC GGGACCCCAC ATGATATCCC 
GGGTCAACGG AATCCGCGCC CACCGAAACC GAATTCTCTT GGAACAGGCG GCTATTACCA 
CCACACCTCG TAATAACCTT AATCCCCGTA GTTGGCCCGC TGCCCTGGTG TACCAGGAAA 
GTCCCGCTCC CACCACTGTG GTACTTCCCA GAGACGCCCA GGCCGAAGTT CAGATGACTA 
ACTCAGGGGC GCAGCTTGCG GGCGGCTTTC GTCACAGGGT GCGGTCGCCC GGGCAGGGTA 
TAACTCACCT GACAATCAGA GGGCGAGGTA TTCAGCTCAA CGACGAGTCG GTGAGCTCCT 
CGCTTGGTCT CCGTCCGGAC GGGACATTTC AGATCGGCGG CGCCGGCCGT CCTTCATTCA 
CGCCTCGTCA GGCAATCCTA ACTCTGCAGA CCTCGTCCTC TGAGCCGCGC TCTGGAGGCA 
TTGGAACTCT GCAATTTATT GAGGAGTTTG TGCCATCGGT CTACTTTAAC CCCTTCTCGG 
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GACCTCCCGG CCACTATCCG GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG 27840 
CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG 27900 
TCCACTGTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT 27960 
TGCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 28020 
TTGCCCGTAG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 28080 
GACCCTG7GT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTTT 28140 
GTTGCCATCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT 
GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
CCAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCGCCTGCTG 
TGTGCACATT TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC CAAGATGATT 
AGGTACATAA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA GTGCACCACT 
CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGCGAC 
ATTACCATGT ACATGAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
ACTGGCACTT TCTGCTGCAC TGCTATGCTA ATTACAGTGC TCGCTTTGGT CTGTACCCTA 
CTCTATATTA AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAAT GCCTTAATTT 
ACTAAGTTAC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC AAAACAAATT 
CAAAAAGTTA GCATTATAAT TAGAATAGGA TTTAAACCCC CCGGTCATTT CCTGCTCAAT 
ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
GTCCAACTAC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCCGCCGCT 
ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA TAACTGGGAT 29580 
AACTTGG3CA TGTGGTGGTT CTCCATAGCG CTTATGTTTG TATGCCTTAT TATTATGTGG 29640 
CTCATCT3CT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTCC CATCATTGTG 29700 
CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT GTTCTTTTCT 29760 
CTTACAGTAT GATTAAATGA GACATGATTC CTCGAGTTTT TATATTACTG ACCCTTGTTG 29820 
CGCTTTTTTG TGCGTGCTCC ACATTGGCTG CGGTTTCTCA CATCGAAGTA GACTGCATTC 29880 
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CAGCCTTCAC AGTCTATTTG CTTTACGGAT TTGTCACCCT CACGCTCATC TGCAGCCTCA 29940 
TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGCGC TTTGCATATC 30000 
TCAGACACCA TCCCCAGTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA ATTCTTTAAT 30060 
TATGAAATTT ACTGTGACTT TTCTGCTGAT TATTTGCACC CTATCTGCGT TTTGTTCCCC 30120 
GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA ATATTCCAAG 
TTGCTACAAT GAAAAAAGCG ATCTTTCCGA AGCCTGGTTA TATGCAATCA TCTCTGTTAT 
GGTGTTCTGC AGTACCATCT TAGCCCTAGC TATATATCCC TACCTTGACA TTGGCTGGAA 
ACGAATAGAT GCCATGAACC ACCCAACTTT CCCCGCGCCC GCTATGCTTC CACTGCAACA 
AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC CCACCCCCAC 
TGAAATCAGC TACTTTAATC TAACAGGAGG AGATGACTGA CACCCTAGAT CTAGAAATGG 
ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GAGCAACAGC 
GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGCAAAAGG GGTATCTTTT 
GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC CGCCTTAGCT 
ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG CCCATTACCA 
TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA GGACCTGAGG 
ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC TTTAACTAAT 
AAAAAAAAAT AATAAAGCAT CACTTACTTA AAATCAGTTA GCAAATTTCT GTCCAGTTTA 
TTCAGCAGCA CCTCCTTGCC CTCCTCCCAG CTCTGGTATT GCAGCTTCCT CCTGGCTGCA 
AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC ATCCGCACCC 
ACTATCTTCA TGTTGTTGCA GATGAAGCGC GCAAGACCGT CTGAAGATAC CTTCAACCCC 
GTGTATCCAT ATGACACGGA AACCGGTCCT CCAACTGTGC CTTTTCTTAC TCCTCCCTTT 
GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG CCTATCCGAA 
CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT CTCTCTGGAC 
GAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT CAAAAAAACC 
AAGTCAAACA TAAACCTGGA AATATCTGCA CCCCTCACAG TTACCTCAGA AGCCCTAACT 
GTGGCTGCCG CCGCACCTCT AATGGTCGCG GGCAACACAC TCACCATGCA ATCACAGGCC 
CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT CACAGTGTCA 
GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG CAGTACCCTT 
ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GCTTGGGCAT TGACTTGAAA 
GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC TTTGCATGTA 
ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT TAATAATACT 
TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA TATGCAACTT 
AATGTAGCAG GAGGACTAAG GATTGATTCT CAAAACAGAC GCCTTATACT TGATGTTAGT 
TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC TCTTTTTATA 
AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT TACAGCTTCA 
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TGGCCAAAAC 


33660 




or"T*A nr A /*»A /^Ti 
G L X A X AC ACT 


^1 TV ^ T( TV ^^^^ 

G CAGGGAACC 


GGGACTGGAA 


CAATGACAGT 


GGAGAGCCCA 


33720 


\3\3tW^ X X MM 


^0 A*l A T^O TV 

C C A X GG AT C A 


TC ATG CTCGT 


CATGATATCA 


ATGTTGGCAC 


AACACAGGCA 


33780 


v«M\wVa X Vj^M 1 M 


CAuTTCCTCA 


^9^V^nTfWTt ^ ^ 

GGATTACAAG 


CTCCTCCCGC 


GTTAGAACCA 


TATCCCAGGG 


33840 


AACAACCCAT 


TCCTGAATCA 


GCGTAAATCC 


CACACTGCAG 


GGAAGACCTC 


GCACGTAACT 


33900 


CACGTTGTGC 


ATTGTCAAAG 


TGTTACATTC 


GGGCAGCAGC 


GGATGATCCT 


CCAGTATGGT 


33960 


AGCGCGGGTT 


TCTGTCTCAA 


AAGGAGGTAG 


ACGATCCCTA 


CTGTACGGAG 


TGCGCCGAGA 


34020 


CAACCGAGAT 


CGTGTTGGTC 


GTAGTGTCAT 


GCCAAATGGA 


ACGCCGGACG 


TAGTCATATT 


34080 
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TCCTGAAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT GCGTCTCCGG TCTCGCCGCT 34140 
TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT CAAAGCATCC AGGCGCCCCC 34200 
TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC CCTGATAACA TCCACCACCG 34260 
CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG CGAGTCACAC ACGGGAGGAG 34320 
CGGGAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA AAAGATTATC CAAAACCTCA 34380 
AAATGAAGAT CTATTAAGTG AACGCGCTCC CCTCCGGTGG CGTGGTCAAA CTCTACAGCC 34440 
AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG CTTCCAAAAG GCAAACGGCC 34500 
CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT GAATCTCCTC TATAAACATT 34560 
CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC ACCTTCTCAA TATATCTCTA 34620 
AGCAAATCCC GAATATTAAG TCCGGCCATT GTAAAAATCT GCTCCAGAGC GCCCTCCACC 34680 
TTCAGCCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG TTCCTCACAG ACCTGTATAA 34740 
GATTCAAAAG CGGAACATTA ACAAAAATAC CGCGATCCCG TAGGTCCCTT CGCAGGGCCA 34800 
GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC CACTTCCCCG CCAGGAACCT 34860 
TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG AGCTATGCTA ACCAGCGTAG 34920 
CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT GCAAGGTGCT GCTCAAAAAA 34980 
TCAGGCAAAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT CATGCTCATG CAGATAAAGG 35040 
CAGGTAAGCT CCGGAACCAC CACAGAAAAA GACACCATTT TTCTCTCAAA CATGTCTGCG 3 5100 
GGTTTCTGCA TAAACACAAA ATAAAATAAC AAAAAAACAT TTAAACATTA GAAGCCTGTC 35160 
TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACGGAC TACGGCCATG CGGGCGTGAC 
CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA CAGCTCCTCG GTCATGTCCG 
GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT CATCGGTCAG TGCTAAAAAG 
CGACCGAAAT AGCCCGGGGG AATACATACC CGCAGGCGTA GAGACAACAT TACAGCCCCC 
ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT AAACACCTGA AAAACCCTCC 35460 
TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT ACAGCGCTTC ACAGCGGCAG 35520 
CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA AAAAAACACC ACTCGACACG 
GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT GCAGAGCGAG TATATATAGG 
ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC CCAGAAAACC GCACGCGAAC 
CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT CAAATCGTCA CTTCCGTTTT 
CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT CCCAACACAT ACAAGTTACT 
CCGCCCTAAA ACCTACGTCA CCCGCCCCGT TCCCACGCCC CGCGCCACGT CACAAACTCC 
ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT ATATTATTGA TGATG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



35220 
35280 
35340 
35400 
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35760 
35820 
35880 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

TTCATTTTAT GTTTCAGGTT CAGGG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTACCGCCAC ACTCGCAGGG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



20 



TTATTTTGGA 


TTGAAGCCAA 


TATGATAATG 


AGGGGGTG6A 


GTTTGTGACG 


TGGCGCGGGG 


60 


CGTGGGAACG 


GGGCGGGTGA 


CGTAGTAGTG 


TGGCGGAAGT 


GTGATGTTGC 


AAGTGTGGCG 


120 


GAACACATGT 


AAGCGACGGA 


TGTGGCAAAA 


GTGACGTTTT 


TGGTGTGCGC 


CGGATCCACA 


180 


GGACGGGTGT 


GGTCGCCATG 


ATCGCGTAGT 


CGATAGTGGC 


TCCAAGTAGC 


GAAGCGAGCA 


240 


GGACTGGGCG 


GCGGCCAAAG 


CGGTCGGACA 


GTGCTCCGAG 


AACGGGTGCG 


CATAGAAATT 


300 


GCATCAACGC 


ATATAGCGCT 


AGCAGCACGC 


CATAGTGACT 


GGCGATGCTG 


TCGGAATGGA 


360 


CGATATCCCG 


CAAGAGGCCC 


GGCAGTACCG 


GCATAACCAA 


GCCTATGCCT 


ACAGCATCCA 


420 


GGGTGACGGT 


GCCGAGGATG 


ACGATGAGCG 


CATTGTTAGA 


TTTCATACAC 


GGTGCCTGAC 


480 


TGCGTTAGCA 


ATTTAACTGT 


GATAAACTAC 


CGCATTAAAG 


CTTATCGATG 


ATAAGCTGTC 


540 


AAACATGAGA 


ATTCTTGAAG 


ACGAAAGGGC 


CTCGTGATAC 


GCCTATTTTT 


ATAGGTTAAT 


600 


GTCATGATAA 


TAATGGTTTC 


TTAGACGTCA 


GGTGGCACTT 


TTCGGGGAAA 


TGTGCGCGGA 


660 


ACCCCTATTT 


GTTTATTTTT 


CTAAATACAT 


TCAAATATGT 


ATCCGCTCAT 


GAGACAATAA 


720 


CCCTGATAAA 


TGCTTCAATA 


ATATTGAAAA 


AGGAAGAGTA 


TGAGTATTCA 


ACATTTCCGT 


780 


GTCGCCCTTA 


TTCCCTTTTT 


TGCGGCATTT 


TGCCTTCCTG 


TTTTTGCTCA 


CCCAGAAACG 


840 


CTGGTGAAAG 


TAAAA6ATGC 


TGAAGATCAG 


TTGGGTGCAC 


GAGTGGGTTA 


CATCGAACTG 


900 


GATCTCAACA 


GCGGTAAGAT 


CCTTGAGAGT 


TTTCGCCCCG 


AAGAACGTTT 


TCCAATGATG 


960 


AGCACTTTTA 


AAGTTCTGCT 


ATGTGGCGCG 


GTATTATCCC 


GTGTTGACGC 


CGGGCAAGAG 


1020 


CAACTCGGTC 


GCCGCATACA 


CTATTCTCAG 


AATGACTTGG 


TTGAGTACTC 


ACCAGTCACA 


1080 
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AGTGATAACA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC 
GCTTTTTTGC ACAACATGGG GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG 
AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC CTGCAGCAAT GGCAACAACG 
TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC 
TGGATGGAGG CGGATAAA6T TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG 
TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG 
GGGCCAGATG GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT 
ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 
CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT 
AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG 
TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT 
TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 
TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG 
CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT 
GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 
GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG 
TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA 
CTGAGATACC TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 
GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 
GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 
TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT 
TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT 
GATTCTGTGG ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA 
ACGACCGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCTGAT GCGGTATTTT 
CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATGGT GCACTCTCAG TACAATCTGC 
TCTGATGCCG CATAGTTAAG CCAGTATACA CTCCGCTATC GCTACGTGAC TGGGTCATGG 
CTGCGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG 
CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC 
CGTCATCACC GAAACGCGCG AGGCAGTCTA GACAATAGTA GTACGGATAG CTGTGACTCC 
GGTCCTTCTA ACACACCTCC TGAGATACAC CCGGTGGTCC CGCTGTGCCC CATTAAACCA 
GTTGCCG7GA GAGTTGGTGG GCGTCGCCAG GCTGTGGAAT GTATCGAGGA CTTGCTTAAC 
GAGCCTGGGC AACCTTTGGA CTTGAGCTGT AAACGCCCCA GGCCATAAGG TGTAAACCTG 
TGATTGCGTG TGTGGTTAAC GCCTTTGTTT GCT6AATGAG TTGATGTAAG TTTAATAAAG 
GGTGAGATAA TGTTTAACTT GCATGGCGTG TTAAATGGGG CGGGGCTTAA AGGGTATATA 
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ATGCGCCGTG 


GGCTAATCTT 


GGTTACATCT 


GACCTCATGG 


AGGCTTGGGA 


GTGTTTGGAA 


3240 


GATTTTTCTG 


CTGTGCGTAA 


CTTGCTGGAA 


CAGAGCTCTA 


ACAGTACCTC 


TTGGTTTTGG 


3300 


AGGTTTCTGT 


GGGGCTCATC 


CCAGGCAAAG 


TTAGTCTGCA 


GAATTAAGGA 


GGATTAGAAG 


3360 


TGGGAATTTG 


AAGAGCTTTT 


GAAATCCTGT 


GGTGAGCTGT 


TTGATTCTTT 


GAATCTGGGT 


3420 


CACCAGGCGC 


TTTTCCAAGA 


GAAGGTCATC 


AAGACTTTGG 


ATTTTTCCAC 


ACCGGGGCGC 


3480 


GCTGCGGCTG 


CTGTTGCTTT 


TTTGAGTTTT 


ATAAAGGATA 


AATGGAGCGA 


AGAAACCCAT 


3540 


CTGAGCGGGG 


GGTACCTGCT 


GGATTTTCTG 


GCCATGCATC 


TGTGGAGAGC 


GGTTGTGAGA 


3600 


CACAAGAATC 


GCCTGCTACT 


GTTGTCTTCC 


GTCCGCCCGG 


CGATAATACC 


GACGGAGGAG 


3660 


CAGCAGCAGC 


AGCAGGAGGA 


AGCCAGGCGG 


CGGCGGCAGG 


AGCAGAGCCC 


ATGGAACCCG 


3720 


AGAGCCGGCC 


TGGACCCTCG 


GGAATGAATG 


TTGTACAGGT 


GGCTGAACTG 


TATCCAGAAC 


3780 


TGAGACGCAT 


TTTGACAATT 


ACAGAGGATG 


GGCAGGGGCT 


AAAGGGGGTA 


AAGAGGGAGC 


3840 


GGGGGGCTTG 


TGAGGCTACA 


GAGGAGGCTA 


GGAATCTAGC 


TTTTAGCTTA 


ATGACCAGAC 


3900 


ACCGTCCTGA 


GTGTATTACT 


TTTCAACAGA 


TCAAGGATAA 


TTGCGCTAAT 


GAGCTTGATC 


3960 


TGCTGGCGCA 


GAAGTATTCC 


ATAGAGCAGC 


TGACCACTTA 


CTGGCTGCAG 


CCAGGGGATG 


4020 


ATTTTGAGGA 


GGCTATTAGG 


GTATATGCAA 


AGGTGGCACT 


TAGGCCAGAT 


TGCAAGTACA 


4080 


AGATCAGCAA 


ACTTGTAAAT 


ATCAGGAATT 


GTTGCTACAT 


TTCTGGGAAC 


GGGGCCGAGG 


4140 


TGGAGATAGA 


TACGGAGGAT 


AGGGTGGCCT 


TTAGATGTAG 


CATGATAAAT 


ATGTGGCCGG 


4200 


GGGTGCTTGG 


CATGGACGGG 


GTGGTTATTA 


TGAATGTAAG 


GTTTACTGGC 


CCCAATTTTA 


4260 


GCGGTACGGT 


TTTCCTGGCC 


AATACCAACC 


TTATCCTACA 


CGGTGTAAGC 


TTCTATGGGT 


4320 


TTAACAATAC 


CTGTGTGGAA 


GCCTGGACCG 


ATGTAAGGGT 


TCGGGGCTGT 


GCCTTTTACT 


4380 


GCTGCTGGAA 


GGGGGTGGTG 


TGTCGCCCCA 


AAAGCAGGGC 


TTCAATTAAG 


AAATGCCTCT 


4440 


TTGAAAGGTG 


TACCTTGGGT 


ATCCTGTCTG 


AGGGTAACTC 


CAGGGTGCGC 


CACAATGTGG 


4500 


CCTCCGACTG 


TGGTTGCTTC 


ATGCTAGTGA 


AAAGCGTGGC 


TGTGATTAAG 


CATAACATGG 


4560 


TATGTGGCAA 


CTGCGAGGAC 


AGGGCCTCTC 


AGATGCTGAC 


CTGCTCGGAC 


GGCAACTGTC 


4620 


ACCTGCTGAA 


GACCATTCAC 


GTAGCCAGCC 


ACTCTCGCAA 


GGCCTGGCCA 


GTGTTTGAGC 


4680 


ATAACATACT 


GACCCGCTGT 


TCCTTGCATT 


TGGGTAACAG 


GAGGGGGGTG 


TTCCTACCTT 


4740 


ACCAATGCAA 


TTTGAGTCAC 


ACTAAGATAT 


TGCTTGAGCC 


CGAGAGCATG 


TCCAAGGTGA 


4800 


ACCTGAACGG 


GGTGTTTGAC 


ATGACCATGA 


AGATCTGGAA 


GGTGCTGAGG 


TACGATGAGA 


4860 


CCCGCACCAG 


GTGCAGACCC 


TGCGAGTGTG 


GCGGTAAACA 


TATTAGGAAC 


CAGCCTGTGA 


4920 


TGCTGGATGT 


GACCGAGGAG 


CTGAGGCCCG 


ATCACTTGGT 


GCTGGCCTGC 


ACCCGCGCTG 


4980 


AGTTTGGCTC 


TAGCGATGAA 


GATACAGATT 


GAGGTACTGA 


AATGTGTGGG 


CGTGGCTTAA 


5040 


GGGTGGGAAA 


GAATATATAA 


GGTGGGGGTC 


TTATGTAGTT 


TTGTATCTGT 


TTTGCAGCAG 


5100 


CCGCCGCCGC 


CATGAGCACC 


AACTCGTTTG 


ATGGAAGCAT 


TGTGAGCTCA 


TATTTGACAA 


5160 


CGCGCATGCC 


CCCATGGGCC 


GGGGTGCGTC 


AGAATGTGAT 


GGGCTCCAGC 


ATTGATGGTC 


5220 


GCCCCGTCCT 


GCCCGCAAAC 


TCTACTACCT TGACCTACGA GACCGTGTCT 


GGAACGCCGT 


5280 
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TGGAGACTGC AGCCTCCGCC GCCGCTTCAG CCGCTGCAGC CACCGCCCGC GGGATTGTGA 5340 
CTGACTTTGC TTTCCTGAGC CCGCTTGCAA GCAGTGCAGC TTCCCGTTCA TCCGCCCGCG 
ATGACAAGTT GACGGCTCTT TTGGCACAAT TGGATTCTTT GACCCGGGAA CTTAATGTCG 
TTTCTCAGCA GCTGTTGGAT CTGCGCCAGC AGGTTTCTGC CCTGAAGGCT TCCTCCCCTC 



5400 



CCAATGCGGT TTAAAACATA AATAAAAAAC CAGACTCTGT TTGGATTTGG ATCAAGCAAG 
TGTCTTGCTG TCTTTATTTA GGGGTTTTGC GCGCGCGGTA GGCCCGGGAC CAGCGGTCTC 
GGTCGTTGAG GGTCCTGTGT ATTTTTTCCA GGACGTGGTA AAGGTGACTC TGGATGTTCA 
GATACATGGG CATAAGCCCG TCTCTGGGGT GGAGGTAGCA CCACTGCAGA GCTTCATGCT 
GCGGGGTGGT GTTGTAGATG ATCCAGTCGT AGCAGGAGCG CTGGGCGTGG TGCCTAAAAA 
TGTCTTTCAG TAGCAAGCTG ATTGCCAGGG GCAGGCCCTT GGTGTAAGTG TTTACAAAGC 
GGTTAAGCTG GGATGGGTGC ATACGTGGGG ATATGAGATG CATCTTGGAC TGTATTTTTA 
GGTTGGCTAT GTTCCCAGCC ATATCCCTCC GGGGATTCAT GTTGTGCAGA ACCACCAGCA 
CAGTGTATCC GGTGCACTTG GGAAATTTGT CATGTAGCTT AGAAGGAAAT GCGTGGAAGA 
ACTTGGAGAC GCCCTTGTGA CCTCCAAGAT TTTCCATGCA TTCGTCCATA ATGATGGCAA 
TGGGCCCACG GGCGGCGGCC TGGGCGAAGA TATTTCTGGG ATCACTAACG TCATAGTTGT 
GTTCCAGGAT GAGATCGTCA TAGGCCATTT TTACAAAGCG CGGGCGGAGG GTGCCAGACT 
GCGGTATAAT GGTTCCATCC GGCCCAGGGG CGTAGTTACC CTCACAGATT TGCATTTCCC 
ACGCTTTGAG TTCAGATGGG GGGATCATGT CTACCTGCGG GGCGATGAAG AAAACGGTTT 
CCGGGGTAGG GGAGATCAGC TGGGAAGAAA GCAGGTTCCT GAGCAGCTGC GACTTACCGC 
AGCCGGTGGG CCCGTAAATC ACACCTATTA CCGGGTGCAA CTGGTAGTTA AGAGAGCTGC 
AGCTGCCGTC ATCCCTGAGC AGGGGGGCCA CTTCGTTAAG CATGTCCCTG ACTCGCATGT 
TTTCCCTGAC CAAATCCGCC AGAAGGCGCT CGCCGCCCAG CGATAGCAGT TCTTGCAAGG 
AAGCAAAGTT TTTCAACGGT TTGAGACCGT CCGCCGTAGG CATGCTTTTG AGCGTTTGAC 
CAAGCAGTTC CAGGCGGTCC CACAGCTCGG TCACCTGCTC TACGGCATCT CGATCCAGCA 
TATCTCCTCG TTTCGCGGGT TGGGGCGGCT TTCGCTGTAC GGCAGTAGTC GGTGCTCGTC 
CAGACGGGCC AGGGTCATGT CTTTCCACGG GCGCAGGGTC CTCGTCAGCG TAGTCTGGGT 
CACGGTGAAG GGGTGCGCTC CGGGCTGCGC GCTGGCCAGG GTGCGCTTGA GGCTGGTCCT 
GCTGGTGCTG AAGCGCTGCC GGTCTTCGCC CTGCGCGTCG GCCAGGTAGC ATTTGACCAT 
GGTGTCATAG TCCAGCCCCT CCGCGGCGTG GCCCTTGGCG CGCAGCTTGC CCTTGGAGGA 
GGCGCCGCAC GAGGGGCAGT GCAGACTTTT GAGGGCGTAG AGCTTGGGCG CGAGAAATAC 
CGATTCCGGG GAGTAGGCAT CCGCGCCGCA GGCCCCGCAG ACGGTCTCGC ATTCCACGAG 
CCAGGTGAGC TCTGGCCGTT CGGGGTCAAA AACCAGGTTT CCCCCATGCT TTTTGATGCG 
TTTCTTACCT CTGGTTTCCA TGAGCCGGTG TCCACGCTCG GTGACGAAAA GGCTGTCCGT 
GTCCCCGTAT ACAGACTTGA GAGGCCTGTC CTCGAGCGGT GTTCCGCGGT CCTCCTCGTA 
TAGAAACTCG GACCACTCTG AGACAAAGGC TCGCGTCCAG GCCAGCACGA AGGAGGCTAA 
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GTGGGAGGGG 


TAGCGGTCGT 


TGTCCACTAG 


GGGGTCCACT 


CGCTCCAGGG 


TGTGAAGACA 


7440 


CATGTCvj^^CC 


TCTTCGGCAT 


CAAGGAAGGT 


GATTGGTTTG 


TAGGTGTAGG 


CCACGTGACC 


7500 


GGGTGTTCCT 


GAAGGGGGGC 


TATAAAAGGG 


GGTGGGGGCG 


CGTTCGTCCT 


CACTCTCTTC 


7560 


CGCATCGCTG 


TCTGCGAGGG 


CCAGCTGTTG 


GGGTGAGTAC 


TCCCTCTGAA 


AAGCGGGCAT 


7620 


GACTTCTGCG 


CTAAGATTGT 


CAGTTTCCAA 


AAACGAGGAG 


GATTTGATAT 


TCACCTGGCC 


7680 


CGCGGTGATG 


CCTTTGAGGG 


TGGCCGCATC 


CATCTGGTCA 


GAAAAGACAA 


TCTTTTTGTT 


7740 


GTCAAGCTTG 


GTGGCAAACG 


ACCCGTAGAG 


GGCGTTGGAC 


AGCAACTTGG 


CGATGGAGCG 


7800 


CAGGGTTTGG 


TTTTTGTCGC 


GATCGGCGCG 


CTCCTTGGCC 


GCGATGTTTA 


GCTGCACGTA 


7860 


TTCGCGCGCA 


ACGCACCGCC 


ATTCGGGAAA 


GACGGTGGTG 


CGCTCGTCGG 


GCACCAGGTG 


7920 


CACGCGCCAA 


CCGCGGTTGT 


GCAGGGTGAC 


AAGGTCAACG 


CTGGTGGCTA 


CCTCTCCGCG 


7980 


TAGGCGCTCG 


TTGGTCCAGC 


AGAGGCGGCC 


GCCCTTGCGC 


GAGCAGAATG 


GCGGTAGGGG 


8040 


GTCTAGCTGC 


GTCTCGTCCG 


GGGGGTCTGC 


GTCCACGGTA 


AAGACCCCGG 


GCAGCAGGCG 


8100 


CGCGTCGAAG 


TAGTCTATCT 


TGCATCCTTG 


CAAGTCTAGC 


GCCTGCTGCC 


ATGCGCGGGC 


8160 


GGCAAGCGCG 


CGCTCGTATG 


GGTTGAGTGG 


GGGACCCCAT 


GGCATGGGGT 


GGGTGAGCGC 


8220 


GGAGGCGTAC 


ATGCCGCAAA 


TGTCGTAAAC 


GTAGAGGGGC 


TCTCTGAGTA 


TTCCAAGATA 


8280 


TGTAGGGTAG 


CATCTTCCAC 


CGCGGATGCT 


GGCGCGCACG 


TAATCGTATA 


GTTCGTGCGA 


8340 


GGGAGCGAGG 


AGGTCGGGAC 


CGAGGTTGCT 


ACGGGCGGGC 


TGCTCTGCTC 


GGAAGACTAT 


8400 


CTGCCTGAAG 


ATGGCATGTG 


AGTTGGATGA 


TATGGTTGGA 


CGCTGGAAGA 


CGTTGAAGCT 


8460 


GGCGTCTGTG 


AGACCTACCG 


CGTCACGCAC 


GAAGGAGGCG 


TAGGAGTCGC 


GCAGCTTGTT 


8520 


GACCAGCTCG 


GCGGTGACCT 


GCACGTCTAG 


GGCGCAGTAG 


TCCAGGGTTT 


CCTTGATGAT 


8580 


GTCATACTTA 


TCCTGTCCCT 


TTTTTTTCCA 


CAGCTCGCGG 


TTGAGGACAA 


ACTCTTCGCG 


8640 


GTCTTTCCAG 


TACTCTTGGA 


TCGGAAACCC 


GTCGGCCTCC 


GAACGGTAAG 


AGCCTAGCAT 


8700 


GTAGAACTGG 


TTGACGGCCT 


GGTAGGCGCA 


GCATCCCTTT 


TCTACGGGTA 


GCGCGTATGC 


8760 


CTGCGCGGCC 


TTCCGGAGCG 


AGGTGTGGGT 


GAGCGCAAAG 


GTGTCCCTGA 


CCATGACTTT 


8820 


GAGGTACTGG 


TATTTGAAGT 


CAGTGTCGTC 


GCATCCGCCC 


TGCTCCCAGA 


GCAAAAAGTC 


8880 


CGTGCGCTTT 


TTGGAACGCG 


GATTTGGCAG 


GGCGAAGGTG 


ACATCGTTGA 


AGAGTATCTT 


8940 


TCCCGCGCGA 


GGCATAAAGT 


TGCGTGTGAT 


GCGGAAGGGT 


CCCGQCACCT 


CGGAACGGTT 


9000 


GTTAATTACC 


TGGGCGGCGA 


GCACGATCTC 


GTCAAAGCCG 


TTGATGTTGT 


GGCCCACAAT 


9060 


GTAAAGTTCC 


AAGAAGCGCG 


GGATGCCCTT 


GATGGAAGGC 


AATTTTTTAA 


GTTCCTCGTA 


9120 


GGTGAGCTCT 


TCAGGGGAGC 


TGAGCCCGTG 


CTCTGAAAGG 


GCCCAGTCTG 


CAAGATGAGG 


9180 


GTTGGAAGCG 


ACGAATGAGC 


TCCACAGGTC 


ACGGGCCATT 


AGCATTTGCA 


GGTGGTCGCG 


9240 


AAAGGTCCTA 


AACTGGCGAC 


CTATGGCCAT 


TTTTTCTGGG 


GTGATGCAGT 


AGAAGGTAAG 


9300 


CGGGTCTTGT 


TCCCAGCGGT 


CCCATCCAAG 


GTTCGCGGCT 


AGGTCTCGCG 


CGGCAGTCAC 


9360 


TAGAGGCTCA 


TCTCCGCCGA 


ACTTCATGAC 


CAGCATGAAG 


GGCACGAGCT 


GCTTCCCAAA 


9420 


GGCCCCCATC 


CAAGTATAGG 


TCTCTACATC 


GTAGGTGACA 


AAGAGACGCT 


CGGTGCGAGG 


9480 
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ATGCGAGCCG ATCGGGAAGA ACTGGATCTC CCGCCACCAA TTGGAGGAGT GGCTATTGAT 9540 
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GTGGTGAAAG TAGAAGTCCC TGCGACGGGC CGAACACTCG TGCTGGCTTT TGTAAAAACG 
TGCGCAGTAC TGGCAGCGGT GCACGGGCTG TACATCCTGC ACGAGGTTGA CCTGACGACC 
GCGCACAAGG AAGCAGAGTG GGAATTTGAG CCCCTCGCCT GGCGGGTTTG GCTGGTGGTC 
TTCTACTTCG GCTGCTTGTC CTTGACCGTC TGGCTGCTCG AGGGGAGTTA CGGTGGATCG 
GACCACCACG CCGCGCGAGC CCAAAGTCCA GATGTCCGCG CGCGGCGGTC GGAGCTTGAT 
GACAACATCG CGCAGATGGG AGCTGTCCAT GGTCTGGAGC TCCCGCGGCG TCAGGTCAGG 
CGGGAGCTCC TGCAGGTTTA CCTCGCATAG ACGGGTCAGG GCGCGGGCTA GATCCAGGTG 
ATACGTAATT TCCAGGGGCT GGTTGGTGGC GGCGTCGATG GCTTGCAAGA GGCCGCATCC 
CCGCGGCGCG ACTACGGTAC CGCGCGGCGG GCGGTGGGCC GCGGGGGTGT CCTTGGATGA 
TGCATCTAAA AGCGGTGACG CGGGCGAGCC CCCGGAGGTA GGGGGGGCTC CGGACCCGCC 
GGGAGAGGGG GCAGGGGCAC GTCGGCGCCG CGCGCGGGCA GGAGCTGGTG CTGCGCGCGT 
AGGTTGCTGG CGAACGCGAC GACGCGGCGG TTGATCTCCT GAATCTGGCG CCTCTGCGTG 
AAGACGACGG GCCCGGTGAG CTTGAGCCTG AAAGAGAGTT CGACAGAATC AATTTCGGTG 
TCGTTGACGG CGGCCTGGCG CAAAATCTCC TGCACGTCTC CTGAGTTGTC TTGATAGGCG 
ATCTCGGCCA TGAACTGCTC GATCTCTTCC TCCTGGAGAT CTCCGCGTCC GGCTCGCTCC 
ACGGTGGCGG CGAGGTCGTT GGAAATGCGG GCCATGAGCT GCGAGAAGGC GTTGAGGCCT 
CCCTCGTTCC AGACGCGGCT GTAGACCACG CCCCCTTCGG CATCGCGGGC GCGCATGACC 
ACCTGCGCGA GATTGAGCTC CACGTGCCGG GCGAAGACGG CGTAGTTTCG CAGGCGCTGA 
AAGAGGTAGT TGAGGGTGGT GGCGGTGTGT TCTGCCACGA AGAAGTACAT AACCCAGCGT 
CGCAACGTGG ATTCGTTGAT ATCCCCCAAG GCCTCAAGGC GCTCCATGGC CTCGTAGAAG 
TCCACGGCGA AGTTGAAAAA CTGGGAGTTG CGCGCCGACA CGGTTAACTC CTCCTCCAGA 
AGACGGATGA GCTCGGCGAC AGTGTCGCGC ACCTCGCGCT CAAAGGCTAC AGGGGCCTCT 
TCTTCTTCTT CAATCTCCTC TTCCATAAGG GCCTCCCCTT CTTCTTCTTC TGGCGGCGGT 
GGGGGAGGGG GGACACGGCG GCGACGACGG CGCACCGGGA GGCGGTCGAC AAAGCGCTCG 
ATCATCTCCC CGCGGCGACG GCGCATGGTC TCGGTGACGG CGCGGCCGTT CTCGCGGGGG 
CGCAGTTGGA AGACGCCGCC CGTCATGTCC CGGTTATGGG TTGGCGGGGG GCTGCCATGC 
GGCAGGGATA CGGCGCTAAC GATGCATCTC AACAATTGTT GTGTAGGTAC TCCGCCGCCG 
AGGGACCTGA GCGAGTCCGC ATCGACCGGA TCGGAAAACC TCTCGAGAAA GGCGTCTAAC 
CAGTCACAGT CGCAAGGTAG GCTGAGCACC GTGGCGGGCG GCAGCGGGCG GCGGTCGGGG 
TTGTTTCTGG CGGAGGTGCT GCTGATGATG TAATTAAAGT AGGCGGTCTT GAGACGGCGG 
ATGGTCGACA GAAGCACCAT GTCCTTGGGT CCGGCCTGCT GAATGCGCAG GCGGTGGGCC 
ATGCCCCAGG CTTCGTTTTG ACATCGGCGC AGGTCTTTGT AGTAGTCTTG CATGAGCCTT 
TCTACCGGCA CTTCTTCTTC TCCTTCCTCT TGTCCTGCAT CTCTTGCATC TATCGCTGCG 
GCGGCGGCGG AGTTTGGCCG TAGGTGGCGC CCTCTTCCTC CCATGCGTGT GACCCCGAAG 
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GCTGAAGCAG. GGCTAGGTCG 


GCGACAACGC 


GCTCGGCTAA 


TATGGCCTGC 


11640 




TGAGGGTAGA 


CTGGAAGTCA 


TCCATGTCCA 


CAAAGCGGTG 


GTATGCGCCC 


11700 




TGTAAGTGCA 


GTTGGCCATA 


ACGGACCAGT 


TAACGGTCTG 


GTGACCCGGC 


11760 




CGGTGTACCT 


GAGACGCGAG 


TAAGCCCTCG 


AGTCAAATAC 


GTAGTCGTTG 


11820 




CCAGGTACTG 


GTATPPPAPP 


AAAAAGTGCG 


GCGGCGGCTG 


GCGGTAGAGG 


11880 




GGGTGGCCGG 


www X W wwOw\7 


GCGAGATCTT 


CCAACATAAG 


GCGATGATAT 


11940 


C uGTAG ATGT 


ACCTGGACAT 


PP Af3GTG A Tn 


CCGGCGGCGG 


TGGTGGAGGC 


GCGCGGAAAG 


12000 


rCGCGGACGC 


GGTTCCAGAT 


GTTGCGCAGP 


GGCAAAAAGT 


GCTCCATGGT 


CGGGACGCTC 


12060 


1 ^oUL-Vju i CA 


GGCGCGCGCA 


ATCGTTGAPG 

A wV3 X X ^7#\w^9 


CTCTACCGTG 


CAAAAGGAGA 


GCCTGTAAGC 


12120 


V7Viv>(. Au 1 L r r 


CCGTGGTCTG 


GTGGATAAAT 


TCGCAAGGGT 


ATCATGGCGG 


ACGACCGGGG 


12180 


•prp ^ /-» JV /-I 

1 iL.UA\juLLL 


GTATCCGGCC 


GTPPGPPGTG 

w X \w www Ww X w 


ATCCATGCGG 


TTACCGCCCG 


CGTGTCGAAC 


12240 


LUAQaVj 1 Cj rCaL 


GACGTCAGAC 


AAPGGGGGAG 


TGCTCCTTTT 


GGCTTCCTTC 


CAGGCGCGGC 


12300 


CjCjCTGC J.GCG 


CTAGCTTTTT 


TGGPPAPTnrs 


CCGCGCGCAG 


CGTAAGCGGT 


TAGGCTGGAA 


12360 


AG C G AAAG C A 


TTAAGTGGPT 


prsPTPPr'Tr'T 


AGCCGGAGGG 


TTATTTTCCA 


AGGGTTGAGT 


12420 


^^^^^^^^ ^^^4 
CGCGGGACCC 




1 w 1 %^oV3AL>v.o 


GCCGGACTGC 


GGCGAACGGG 


GGTTTGCCTC 


12480 


CCCGTCaTGC 




1 XV3\»A/Wll iC 


CTCCGGAAAC 


AGGGACGAGC 


CCCTTTTTTG 


12540 


CTTTTCCCAG 




1 1 ^CoGCA 


GATGCGCCCC 


CCTCCTCAGC 


AGCGGCAAGA 


12600 


GCAAGAGCAG 




^t^AvjVjbUACC 


CTCCCCTCCT 


CCTACCGCGT 


CAGGAGGGGC 


12660 


GACATCCGCG 


GTT(^ 2v p nrrir! 


v-AVaUAVsATGG 


TGATTACGAA 


CCCCCGCGGC 


GCCGGGCCCG 


12720 


GCACTACCTG 


GAPTTGGiVGG 




CCTGGCGCGG 


CTAGGAGCGC 


CCTCTCCTGA 


12760 


^l^t^^^^^n^ ^^^^^^^ 

GCGGTACCCA 


AGGGTGPAGP 


1 ur/v\\j>CW7 X VjA 


TACGCGTGAG 


GCGTACGTGC- 


CGCGGCAGAA 


12840 


CCTGTTTCGC 


GAPPGPGAf^rZ 




CGAGGAGATG 


CGGGATCGAA 


AGTTCCACGC 


12900 


AGGGCGCGAG 


CTGCGGPATG 


GPPTr! A ATTTS 
UWL.X vsnnl wu 


CGAGCGGTTG 


CTGCGCGAGG 


AGGACTTTGA 


12960 


^^^^^^ /^^^^ 

GCCCGACGCG 


CGAACCGGGA 


TT AGTPP Pft r* 
X X nw X V« ^ 


GCGCGCACAC 


GTGGCGGCCG 


CCGACCTGGT 


13020 


AACCG catac 


GAGCAGAPGG 


TGAAPPAfiPa 


GATTAACTTT 


CAAAAAAGCT 


TTAACAACCA 


13080 


CGTGCGTACG 


CTTGTGGCGC 


GCGAGGAGGT 


GGCTATAGGA 


CTGATGCATC 


TGTGGGACTT 


13140 


TGTAAGCGCG 


CTGGAGCAAA 


ACCCAAATAG 


CAAGCCGCTC 


ATGGCGCAGC 


TGTTCCTTAT 


13200 


AGTGCAGCAC 


AGCAGGGACA 


ACGAGGCATT 


CAGGGATGCG 


CTGCTAAACA 


TAGTAGAGCC 


13260 


CGAGGGCCGC 


TGGCTGCTCG 


ATTTGATAAA 


CATCCTGCAG 


AGCATAGTGG 


TGCAGGAGCG 


13320 


CAGCTTGAGC 


CTGGCTGACA AGGTGGCCGC 


CATCAACTAT 


TCCATGCTTA 


GCCTGGGCAA 


13380 


GTTTTACGCC 


CGCAAGATAT 


ACCATACCCC 


TTACGTTCCC 


ATAGACAAGG 


AGGTAAAGAT 


13440 


CGAGGGGTTC 


TACATGCGCA 


TGGCGCTGAA 


GGTGCTTACC 


TTGAGCGACG 


ACCTGGGCGT 


13500 


TTATCGCAAC 


GAGCGCATCC 


ACAAGGCCGT 


GAGCGTGAGC 


CGGCGGCGCG 


AGCTCAGCGA 


13560 


CCGCGAGCTG 


ATGCACAGCC 


TGCAAAGGGC 


CCTGGCTGGC 


ACGGGCAGCG 


GCGATAGAGA 


13620 


GGCCGAGTCC 


TACTTTGACG 


CGGGCGCTGA 


CCTGCGCTGG 


GCCCCAAGCC 


GACGCGCCCT 


13680 
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GGAGGCAGCT GGGGCCGGAC CTGGGCTGGC GGTGGCACCC GCGCGCGCTG GCAACGTCGG 13740 

CGGCGTGGAG GAATATGACG AGGACGATGA GTACGAGCCA GAGGACGGCG AGTACTAAGC 13800 

GGTGATGTTT CTGATCAGAT GATGCAAGAC GCAACGGACC CGGCGGTGCG GGCGGCGCTG 13860 

CAGAGCCAGC CGTCCGGCCT TAACTCCACG GACGACTGGC GCCAGGTCAT GGACCGCATC 13920 

ATGTCGCTGA CTGCGCGCAA TCCTGACGCG TTCCGGCAGC AGCCGCAGGC CAACCGGCTC 13980 

TCCGCAATTC TGGAAGCGGT GGTCCCGGCG CGCGCAAACC CCACGCACGA GAAGGTGCTG 14040 

GCGATCGTAA ACGCGCTGGC CGAAAACAGG GCCATCCGGC CCGACGAGGC CGGCCTGGTC 14100 

TACGACGCGC TGCTTCAGCG CGTGGCTCGT TACAACAGCG GCAACGTGCA GACCAACCTG 14160 

GACCGGCTGG TGGGGGATGT GCGCGAGGCC GTGGCGCAGC GTGAGCGCGC GCAGCAGCAG 14220 

GGCAACCTGG GCTCCATGGT TGCACTAAAC GCCTTCCTGA GTACACAGCC CGCCAACGTG 14280 

CCGCGGGGAC AGGAGGACTA CACCAACTTT GTGAGCGCAC TGCGGCTAAT GGTGACTGAG 14340 

ACACCGCAAA GTGAGGTGTA CCAGTCTGGG CCAGACTATT TTTTCCAGAC CAGTAGACAA 14400 

GGCCTGCAGA CCGTAAACCT GAGCCAGGCT TTCAAAAACT TGCAGGGGCT GTGGGGGGTG 14460 

CGGGCTCCCA CAGGCGACCG CGCGACCGTG TCTAGCTTGC TGACGCCCAA CTCGCGCCTG 14520 

TTGCTGCTGC TAATAGCGCC CTTCACGGAC AGTGGCAGCG TGTCCCGGGA CACATACCTA 14580 

GGTCACTTGC TGACACTGTA GCGCGAGGCC ATAGGTCAGG CGCATGTGGA CGAGCATACT 14640 

TTCCAGGAGA TTACAAGTGT CAGCCGCGCG CTGGGGCAGG AGGACACGGG CAGCCTGGAG 14700 

GCAACCCTAA ACTACCTGCT GACCAACCGG CGGCAGAAGA TCCCCTCGTT GCACAGTTTA 14760 



AACAGCGAGG 


AGGAGCGCAT 


TTTGCGCTAC 


GTGCAGCAGA 


GCGTGAGCCT 


TAACCTGATG 


14820 


CGCGACGGGG 


TAACGCCCAG 


CGTGGCGCTG 


GACATGACCG 


CGCGCAACAT 


GGAACCGGGC 


14880 


ATGTATGCCT 


CAAACCGGCC 


GTTTATCAAC 


CGCCTAATGG 


ACTACTTGCA 


TCGCGCGGCC 


14940 


GCCGTGAACC 


CCGAGTATTT 


CACCAATGCC 


ATCTTGAACC 


CGCACTGGCT 


ACCGCCCCCT 


15000 


GGTTTCTACA 


CCGGGGGATT 


CGAGGTGCCC 


GAGGGTAACG 


ATGGATTCCT 


CTGGGACGAC 


15060 


ATAGACGACA 


GCGTGTTTTC 


CCCGCAACCG 


CAGACCCTGC 


TAGAGTTGCA 


ACAGCGCGAG 


15120 


CAGGCAGAGG 


CGGCGGTGCG 


AAAGGAAAGC 


TTCCGCAGGC 


CAAGCAGCTT 


GTCCGATCTA 


15180 


GGCGCTGCGG 


CCCCGCGGTC 


AGATGCTAGT 


AGCCCATTTC 


CAAGCTTGAT 


AGGGTCTCTT 


15240 


ACCAGCACTC 


GCACCACCCG 


CCCGCGCCTG 


CTGGGCGAGG 


AGGAGTACCT 


AAACAACTCG 


15300 


CTGCTGCAGC 


CGCAGCGCGA 


AAAAAACCTG 


CCTCCGGCAT 


TTCCCAACAA 


CGGGATAGAG 


15360 


AGCCTAGTGG 


ACAAGATGAG 


TAGATGGAAG 


ACGTACGCGC 


AGGAGCACAG 


GGACGTGCCA 


15420 


GGCCCGCGCC 


CGCCCACCCG 


TCGTCAAAGG 


CACGACCGTC 


AGCGGGGTCT 


GGTGTGGGAG 


15480 


GACGATGACT 


CGGCAGACGA 


CAGCAGCGTC 


CTGGATTTGG 


GAGGGAGTGG 


CAACCCGTTT 


15540 


GCGCACCTTC 


GCCCCAGGCT 


GGGGAGAATG 


TTTTAAAAAA 


AAAAAAGCAT 


GATGCAAAAT 


15600 


AAAAAACTCA 


CCAAGGCCAT 


GGCACCGAGC 


GTTGGTTTTC 


TTGTATTCCC 


CTTAGTATGC 


15660 


GGCGCGCGGC 


GATGTATGAG 


GAAGGTCCTC 


CTCCCTCCTA 


CGAGAGTGTG 


GTGAGCGCGG 


15720 


CGCCAGTGGC 


GGCGGCGCTG 


GGTTCTCCCT 


TCGATGCTCC 


CCTGGACCCG 


CCGTTTGTGC 


15780 
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GAAACAGCAT 


CCGTTACTCT 


PCTAJS97/19541 

GAGTTGGCAC 15840 


CCCTATTCGA 


CACCACCCGT 


GTGTACCTGG 


TGGACAACAA 


GTCAACGGAT 




1 c o A n 

15900 


TGAACTACCA 


GAACGACCAC 


AGCAACTTTC 


TGACCACGGT 


CATTCAAAAC 




15960 


GCCCGGGGGA 


GGCAAGCACA 


CAGACCATCA 


ATCTTGACGA 


CCGGTCGCAC 




16020 


ACCTGAAAAC 


CATCCTGCAT 


ACCAACATGC 


CAAATGTGAA 


CGAGTTCATG 




^ f\ f% f\ 
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AGTTTAAGGC 


GCGGGTGATG 


GTGTCGCGCT 


TGCCTACTAA 


GGACAATCAG 




lol40 


AATACGAGTG 


GGTGGAGTTC 


ACGCTGCCCG 


AGGGCAACTA 


CTCCGAGACC 






ACCTTATGAA 


CAACGCGATC 


GTGGAGCACT 


ACTTGAAAGT 


GGGCAGACAG 




1 o £ rv 


TGGAAAGCGA 


CATCGGGGTA 


AAGTTTGACA 


CCCGCAACTT 


CAGACTGGGG 




±b j2 O 


TCACTGGTCT 


TGTCATGCCT 


GGGGTATATA 


CAAACGAAGC 


CTTCCATCCA 




lo jBO 


TGCTGCCAGG 


ATGCGGGGTG 


GACTTCACCC 


ACAGCCGCCT 


GAGCAACTTG 




16440 


GCAAGCGGCA 


ACCCTTCCAG 


GAGGGCTTTA 


GGATCACCTA 


CGATGATCTG 




16500 


ACATTCCCGC 


ACTGTTGGAT 


GTGGACGCCT 


ACCAGGCGAG 


CTTGAAAGAT 


oAUALcCiAAC 


16560 


AGGGCGGGGG 


TGGCGCAGGC 


GGCAGCAACA 


GCAGTGGCAG 


CGGCGCGGAA 


oAoAACTCCA 


16620 


ACGCGGCAGC 


CGCGGCAATG 


CAGCCGGTGG 


AGGACATGAA 


CGATCATGCC 


ATTCGCGGCG 


16680 


ACACCTTTGC 


CACACGGGCT 


GAGGAGAAGC 


GCGCTGAGGC 


CGAAGCAGCG 


GCCGAAGCTG 


16740 


CCGCCCCCGC 


TGCGCAACCC 


GAGGTCGAGA 


AGCCTCAGAA 


GAAACCGGTG 


ATCAAACCCC 


16800 


TGACAGAGGA 


CAGCAAGAAA 


CGCAGTTACA 


ACCTAATAAG 


CAATGACAGC 


ACCTTCACCC 


16660 


AGTACCGCAG 


CTGGTACCTT 


GCATACAACT 


ACGGCQACCC 


TCAGACCGGA 


ATCCGCTCAT 


16920 


GGACCCTGCT 


TTGCACTCCT 


GACGTAACCT 


GCGGCTCGGA 


GCAGGTCTAC 


1 UGTCGTTGC 


169B0 


CAGACATGAT 


GCAAGACCCC 


GTGACCTTCC 


GCTCCACGCG 


CCAGATCAGC 


AAC-TTTCCGG 


17040 


TGGTGGGCGC 


CGAGCTGTTG 


CCCGTGCACT 


CCAAGAGCTT 


CTACAACGAC 


UAGGCCGTCT 


17100 


ACTCCCAACT 


CATCCGCCAG 


TTTACCTCTC 


TGACCCACGT 


GTTCAATCGC 


'VT*'T*r*r*r*/^ a a 
1 i iuu^GAGA 


17160 


ACCAGATTTT 


GGCGCGCCCG 


CCAGCCCCCA 


CCATCACCAC 


CGTCAGTGAA 


AAL.G 1 1 CCTG 


1 1 ft 
17220 


CTCTCACAGA 


TCACGGGACG 


CTACCGCTGC 


GCAACAGCAT 


CGGAGGAGTC 


UAGCGAGTGA 


^ ^ ft ft 
17280 


CCATTACTGA 


CGCCAGACGC 


CGCACCTGCC 


CCTACGTTTA 


CAAGGCCCTG 


GtaCATAGTCT 


17340 


CGCCGCGCGT 


CCTATCGAGC 


CGCACTTTTT 


GAGCAAGCAT 


GTCCATCCTT 


A 1 A 1 (.GL. C (J A 


^ ii ft ft 
17400 


GCAATAACAC 


AGGCTGGGGC 


CTGCGCTTCC 


CAAGCAAGAT 


GTTTGGCGGG 


GCUAAGAAGw 


17460 


GCTCCGACCA 


ACACCCAGTG 


CGCGTGCGCG 


GGCACTACCG 


CGCGCCCTGG 


GGL G CGCACA 


1 T ft 

17520 


AACGCGGCCG 


CACTGGGCGC 


ACCACCGTCG 


ATGACGCCAT 


CGACGCGGTG 


G i GGAGGAGG 


1 T ft ft 
17580 


CGCGCAACTA 


CACGCCCACG 


CCGCCACCAG 


TGTCCACAGT 


GGACGCGGCC 


ATTCAGACCG 


17640 


TGGTGCGCGG 


AGCCCGGCGC 


TATGCTAAAA 


TGAAGAGACG 


GCGGAGGCGC 


GTAGCACGTC 


17700 


GCCACCGCCG 


CCGACCCGGC 


ACTGCCGCCC 


AACGCGCGGC 


GGCGGCCCTG 


CTTAACCGCG 


17760 


CACGTCGCAC 


CGGCCGACGG 


GCGGCCATGC 


GGGCCGCTCG 


AAGGCTGGCC 


GCGGGTATTG 


17820 


TCACTGTGCC 


CCCCAGGTCC 


AGGCGACGAG 


CGGCCGCCGC 


AGCAGCCGCG 


GCCATTAGTG 


17880 
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CTATGACTCA GGGTCGCAGG GGCAACGTGT ATTGGGTGCG CGACTCGGTT AGCGGCCTGC 17940 
GCGTGCCCGT GCGCACCCGC CCCCCGCGCA ACTAGATTGC AAGAAAAAAC TACTTAGACT 18000 
CGTACTGTTG TATGTATCCA GCGGCGGCGG CGCGCAACGA AGCTATGTCC AAGCGCAAAA 18060 
TCAAAGAAGA GATGCTCCAG GTCATCGCGC CGGAGATCTA TGGCCCCCCG AAGAAGGAAG 18120 
AGCAGGATTA CAAGCCCCGA AAGCTAAAGC GGGTCAAAAA GAAAAAGAAA GATGATGATG 18180 
ATGAACTTGA CGACGAGGTG GAACTGCTGC ACGCTACCGC GCCCAGGCGA CGGGTACAGT 18240 
GGAAAGGTCG ACGCGTAAAA CGTGTTTTGC GACCCGGCAC CACCGTAGTC TTTACGCCCG 
GTGAGCGCTC CACCCGCACC TACAAGCGCG TGTATGATGA GGTGTACGGC GACGAGGACC 
TGCTTGAGCA GGCCAACGAG CGCCTCGGGG AGTTTGCCTA CGGAAAGCGG CATAAGGACA 
TGCTGGCGTT GCCGCTGGAC GAGGGCAACC CAACACCTAG CCTAAAGCCC GTAACACTGC 
AGCAGGTGCT GCCCGCGCTT GCACCGTCCG AAGAAAAGCG CGGCCTAAAG CGCGAGTCTG 
GTGACTTGGC ACCCACCGTG CAGCTGATGG TACCCAAGCG CCAGCGACTG GAAGATGTCT 
TGGAAAAAAT GACCGTGGAA CCTGGGCTGG AGCCCGAGGT CCGCGTGCGG CCAATCAAGC 
AGGTGGCGCC GGGACTGGGC GTGCAGACCG TGGACGTTCA GATACCCACT ACCAGTAGCA 
CCAGTATTGC CACCGCCACA GAGGGCATGG AGACACAAAC GTCCCCGGTT GCCTCAGCGG 
TGGCGGATGC CGCGGTGCAG GCGGTCGCTG CGGCCGCGTC CAAGACCTCT ACGGAGGTGC 
AAACGGACCC GTGGATGTTT CGCGTTTCAG CCCCCCGGCG CCCGCGCGGT TCGAGGAAGT 
ACGGCGCCGC CAGCGCGCTA CTGCCCGAAT ATGCCCTACA TCCTTCCATT GCGCCTACCC 
CCGGCTATCG TGGCTACACC TACCGCCCCA GAAGACGAGC AACTACCCGA CGCCGAACCA 
CCACTGGAAC CCGCCGCCGC CGTCGCCGTC GCCAGCCCGT GCTGGCCCCG ATTTCCGTGC 
GCAGGGTGGC TCGCGAAGGA GGCAGGACCC TGGTGCTGCC AACAGCGCGC TACCACCCCA 
GCATCGTTTA AAAGCCGGTC TTTGTGGTTC TTGCAGATAT GGCCCTCACC TGCCGCCTCC 
GTTTCCCGGT GCCGGGATTC CGAGGAAGTU^ TGCACCGTAG GAGGGGCATG GCCGGCCACG 
GCCTGACGGG CGGCATGCGT CGTGCGCACC ACCGGCGGCG GCGCGCGTCG CACCGTCGCA 
TGCGCGGCGG TATCCTGCCC CTCCTTATTC CACTGATCGC CGCGGCGATT GGCGCCGTGC 
CCGGAATTGC ATCCGTGGCC TTGCAGGCGC AGAGACACTG ATTAAAAACA AGTTGCATGT 
GGAAAAATCA AAATAAAAAG TCTGGACTCT CACGCTCGCT TGGTCCTGTA ACTATTTTGT 
AGAATGGAAG ACATCAACTT TGCGTCTCTG GCCCCGCGAC ACGGCTCGCG CCCGTTCATG 
GGAAACTGGC AAGATATCGG CACCAGCAAT ATGAGCGGTG GCGCCTTCAG CTGGGGCTCG 
CTGTGGAGCG GCATTAAAAA TTTCGGTTCC ACCGTTAAGA ACTATGGCAG CAAGGCCTGG 
AACAGCAGCA CAGGCCAGAT GCTGAGGGAT AAGTTGAAAG AGCAAAATTT CCAACAAAAG 
GTGGTAGATG GCCTGGCCTC TGGCATTAGC GGGGTGGTGG ACCTGGCCAA CCAGGCAGTG 
CAAAATAAGA TTAACAGTAA GCTTGATCCC CGCCCTCCCG TAGAGGAGCC TCCACCGGCC 
GTGGAGACAG TGTCTCCAGA GGGGCGTGGC GAAAAGCGTC CGCGCCCCGA CAGGGAAGAA 
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CCCACCACCC 
ACGCTGGACC 
GCCGTTGTTG 
TCGTTGCGGC 
GGGGTGCAAT 
CATGTATGCG 
AGATGGCTAC 
CCTCGGAGTA 
GCCTGAATAA 
GGTCCCAGCG 
ACAAGGCGCG 
ACTTTGACAT 
CCTACAACGC 
CTGCTCTTGA 
AAGCTGAGCA 
CAAAGGAGGG 
TTCAACCTGA 
CTGGGAGAGT 
CCACAAATGA 
GTCAAGTGGA 
TGACTCCTAA 
TTTCTTACAT 
TGCCCAACAG 
ACAGCACGGG 
ATTTGCAAGA 
GAACCAGGTA 
TTATTGAAAA 
TGATTAATAC 
AAAAAGATGC 
TGGAAATCAA 
ATTTGCCCGA 
CCTACGACTA 
TTGGAGCACG 
ATGCTGGCCT 
TCCAGGTGCC 



GTCCCATCGC 
TGCCTCCCCC 
TAACCCGTCC 
CCGTAGCCAG 
CCCTGAAGCG 
TCCATGTCGC 
CCCTTCGATG 
CCTGAGCCCC 
CAAGTTTAGA 
TTTGACGCTG 
GTTCACCCTA 
CCGCGGCGTG 
CCTGGCTCCC 
AATAAACCTA 
GCAAAAAACT 
TATTCAAATA 
ACCTCAAATA 
CCTTAAAAAG 
AAATGGAGGG 
AATGCAATTT 
AGTGGTATTG 
GCCCACTATT 
GCCTAATTAC 
TAATATGGGT 
CAGAAACACA 
CTTTTCTATG 
TCATGGAACT 
AGAGACTCTT 
TACAGAATTT 
TCTAAATGCC 
CAAGCTAAAG 
CATGAACAAG 
CTGGTCCCTT 
GCGCTACCGC 
TCAGAAGTTC 



GCCCATGGCT 
CGCCGACACC 
TAGCCGCGCG 
TGGCAACTGG 
CCGACGATGC 
CGCCAGAGGA 
ATGCCGCAGT 
GGGCTGGTGC 
AACCCCACGG 
CGGTTCATCC 
GCTGTGGGTG 
CTGGACAGGG 
AAGGGTGCCC 
GAAGAAGAGG 
CACGTATTTG 
GGTGTCGAAG 
GGAGAATCTC 
ACTACCCCAA 
CAAGGCATTC 
TTCTCAACTA 
TACAGTGAAG 
AAGGAAGGTA 
ATTGCTTTTA 
GTTCTGGCGG 
GAGCTTTCAT 
TGGAATCAGG 
GAAGATGAAC 
ACCAAGGTAA 
TCAGATAAAA 
AACCTGTGGA 
TACAGTCCTT 
CGAGTGGTGG 
GACTATATGG 
TCAATGTTGC 
TTTGCCATTA 



ACCGGAGTGC 
CAGCAGAAAC 
TCCCTGCGCC 
CAAAGCACAC 
TTCTGAATAG 
GCTGCTGAGC 
GGTCTTACAT 
AGTTTGCCCG 
TGGCGCCTAC 
CTGTGGACCG 
ATAACCGTGT 
GCCCTACTTT 
CAAATCCTTG 
ACGATGACAA 
GGCAGGCGCC 
GTCAAACACC 
AGTGGTACGA 
TGAAACCAT6 
TTGTAAAGCA 
CTGAGGCGAC 
ATGTAGATAT 
ACTCACGAGA 
GGGACAATTT 
GCCAAGCATC 
ACCAGCTTTT 
CTGTTGACAG 
TTCCAAATTA 
AACCTAAAAC 
ATGAAATAAG 
GAAATTTCCT 
CCAACGTAAA 
CTCCCGGGTT 
ACAACGTCAA 
TGGGCAATGG 
AAAACCTCCT 

-97- 



TGGGCCAGCA 
CTGTGCTGCC 
GCGCCGCCAG 
TGAACAGCAT 
CTAACGTGTC 
CGCCGCGCGC 
GCACATCTCG 
CGCCACCGAG 
GCACGACGTG 
TGAGGATACT 
GCTGGACATG 
TAAGCCCTAC 
CGAATGGGAT 
CGAAGACGAA 
TTATTCTGGT 
TAAATATGCC 
AACTGAAATT 
TTACGGTTCA 
ACAAAATGGA 
CGCAGGCAAT 
AGAAACCCCA 
ACTAATGGGC 
TATTGGTCTA 
GCAGTTGAAT 
GCTTGATTCC 
CTATGATCCA 
CTGCTTTCCA 
AGGTCAGGAA 
AGTTGGAAAT 
GTACTCCAAC 
AATTTCTGAT 
AGTGGACTGC 
CCCATTTAAC 
TCGCTATGTG 
TCTCCTGCCG 
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CACACCCGTA 


20040 


AGGCCCGACC 


20100 


CGGTCCGCGA 


20160 


CGTGGGTCTG 


20220 


GTATGTGTGT 


20280 


CCGCTTTCCA 


20340 


GGCCAGGACG 


20400 


ACGTACTTCA 


20460 


ACCACAGACC 


20520 


GCGTACTCGT 


20580 


GCTTCCACGT 


20640 


TCTGGCACTG 


20700 


GAAGCTGCTA 


20760 


GTAGACGAGC 


20820 


ATAAATATTA 


20880 


GATAAAACAT 


20940 


AATCATGCAG 


21000 


TATGCAAAAC 


21060 


AAGCTAGAAA 


21120 


GGTGATAACT 


21180 


GACACTCATA 


21240 


CAACAATCTA 


21300 


ATGTATTACA 


21360 


GCTGTTGTAG 


21420 


ATTGGTGATA 


21480 


GATGTTAGAA 


21540 


CTGGGAGGTG 


21600 


AATGGATGGG 


21660 


AATTTTGCCA 


21720 


ATAGCGCTGT 


21780 


AACCCAAACA 


21840 


TACATTAACC 


21900 


CACCACCGCA 


21960 


CCCTTCCACA 


22020 


GGCTCATACA 


22080 
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CCTACGAGTG GAACTTCAGG AAGGATGTTA ACATGGTTCT GCAGAGCTCC CTAGGAAATG 22140 



ACCTAAGGGT TGACGGAGCC AGCATTAAGT TTGATAGCAT TTGCCTTTAC GCCACCTTCT 
TCCCCATGGC CCACAACACC GCCTCCACGC TTGAGGCCAT GCTTAGAAAC GACACCAACG 
ACCAGTCCTT TAACGACTAT CTCTCCGCCG CCAACATGCT CTACCCTATA CCCGCCAACG 
CTACCAACGT GCCCATATCC ATCCCCTCCC GCAACTGGGC GGCTTTCCGC GGCTGGGCCT 
TCACGCGCCT TAAGACTAAG GAAACCCCAT CACTGGGCTC GGGCTACGAC CCTTATTACA 
CCTACTCTGG CTCTATACCC TACCTAGATG GAACCTTTTA CCTCAACCAC ACCTTTAAGA 
AGGTGGCCAT TACCTTTGAC TCTTCTGTCA GCTGGCCTGG CAATGACCGC CTGCTTACCC 
CCAACGAGTT TGAAATTAAG CGCTCAGTTG ACGGGGAGGG TTACAACGTT GCCCAGTGTA 
ACATGACCAA AGACTGGTTC CTGGTACAAA TGCTAGCTAA CTACAACATT GGCTACCAGG 
GCTTCTATAT CCCAGAGAGC TACAAGGACC GCATGTACTC CTTCTTTAGA AACTTCCAGC 
CCATGAGCCG TCAGGTGGTG GATGATACTA AATACAAGGA CTACCAACAG GTGGGCATCC 
TACACCAACA CAACAACTCT GGATTTGTTG GCTACCTTGC CCCCACCATG CGCGAAGGAC 
AGGCCTACCC TGCTAACTTC CCCTATCCGC TTATAGGCAA GACCGCAGTT GACAGCATTA 
CCCAGAAAAA GTTTCTTTGC GATCGCACCC TTTGGCGCAT CCCATTCTCC AGTAACTTTA 
TGTCCATGGG CGCACTCACA GACCTGGGCC AAAACCTTCT CTACGCCAAC TCCGCCCACG 
CGCTAGACAT GACTTTTGAG GTGGATCCCA TGGACGAGCC CACCCTTCTT TATGTTTTGT 
TTGAAGTCTT TGACGTGGTC CGTGTGCACC GGCCGCACCG CGGCGTCATC GAAACCGTGT 
ACCTGCGCAC GCCCTTCTCG GCCGGCAACG CCACAACATA AAGAAGCAAG CAACATCAAC 
AACAGCTGCC GCCATGGGCT CCAGTGAGCA GGAACTGAAA GCCATTGTCA AAGATCTTGG 
TTGTGGGCCA TATTTTTTGG GCACCTATGA CAAGCGCTTT CCAGGCTTTG TTTCTCCACA 
CAAGCTCGCC TGCGCCATAG TCAATACGGC CGGTCGCGAG ACTGGGGGCG TACACTGGAT 
GGCCTTTGCC TGGAACCCGC ACTCAAAAAC ATGCTACCTC TTTGAGCCCT TTGGCTTTTC 
TGACCAGCGA CTCAAGCAGG TTTACCAGTT TGAGTACGAG TCACTCCTGC GCCGTAGCGC 
CATTGCTTCT TCCCCCGACC GCTGTATAAC GCTGGAAAAG TCCACCCAAA GCGTACAGGG 
GCCCAACTCG GCCGCCTGTG GACTATTCTG CTGCATGTTT CTCCACGCCT TTGCCAACTG 
GCCCCAAACT CCCATGGATC ACAACCCCAC CATGAACCTT ATTACCGGGG TACCCAACTC 
CATGCTCAAC AGTCCCCAGG TACAGCCCAC CCTGCGTCGC AACCAGGAAC AGCTCTACAG 
CTTCCTGGAG CGCCACTCGC CCTACTTCCG CAGCCACAGT GCGCAGATTA GGAGCGCCAC 
TTCTTTTTGT CACTTGAAAA ACATGTAAAA ATAATGTACT AGAGACACTT TCAATAAAGG 
CAAATGCTTT TATTTGTACA CTCTCGGGTG ATTATTTACC CCCACCCTTG CCGTCTGCGC 
CGTTTAAAAA TCAAAGGGGT TCTGCCGCGC ATCGCTATGC GCCACTGGCA GGGACACGTT 
GCGATACTGG TGTTTAGTGC TCCACTTAAA CTCAGGCACA ACCATCCGCG GCAGCTCGGT 
GAAGTTTTCA CTCCACAGGC TGCGCACCAT CACCAACGCG TTTAGCAGGT CGGGCGCCGA 
TATCTTGAAG TCGCAGTTGG GGCCTCCGCC CTGCGCGCGC GAGTTGCGAT ACACAGGGTT 



22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 
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GCAGCACTGG 


AACACTATCA 


GCGCCGGGTG 


GTGCACGCTG 


GCCAGCACGC 
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TCTTGTCGGA 24240 


GATCAGATCC 


GCGTCCAGGT 


CCTCCGCGTT 


GCTCAGGGCG 


AACGGAGTCA 


X 1 1 1 Ava 


24300 


CTGCCTTCCC 


AAAAAGGGCG 


CGTGCCCAGG 


CTTTGAGTTG 


CACTCGCACC 




24360 


CAAAAGGTGA 


CCGTGCCCGG 


TCTGGGCGTT 


AGGATACAGC 


GCCTGCATAA 




2442 0 


CTGCTTAAAA 


GCCACCTGAG 


CCTTTGCGCC 


TTCAGAGAAG 


AACATGCCGC 


AAUAC i iGCC 


'\ A A o n 

24480 


GGAAAACTGA 


TTGGCCGGAC 


AGGCCGCGTC 


GTGCACGCAG 


CACCTTGCGT 


CGGTGTTGGA 


^ A C A 

24540 


GATCTGCACC 


ACATTTCGGC 


CCCACCGGTT 


CTTCACGATC 


TTGGCCTTGC 


TAGACTGCTC 


24600 


CTTCAGCGCG 


CGCTGCCCGT 


TTTCGCTCGT 


CACATCCATT 


TCAATCACGT 


GCTCCTTATT 


24660 


TATCATAATG 


CTTCCGTGTA 


GACACTTAAG 


CTCGCCTTCG 


ATCTCAGCGC 


AGCGGTGCAG 


24720 


CCACAACGCG 


CAGCCCGTGG 


GCTCGTGATG 


CTTGTAGGTC 


ACCTCTGCAA ACGACTGCAG 


24780 


GTACGCCTGC 


AGGAATCGCC 


CCATCATCGT 


CACAAAGGTC 


TTGTTGCTGG 


TGAAGGTCAG 


24840 


CTGCAACCCG 


CGGTGCTCCT 


CGTTCAGCCA 


GGTCTTGCAT 


ACGGCCGCCA 


GAGCTTCCAC 


24900 


TTGGTCAGGC 


AGTAGTTTGA 


AGTTCGCCTT 


TAGATCGTTA 


TCCACGTGGT 


ACTTGTCCAT 


24960 


CAGCGCGCGC 


GCAGCCTCCA 


TGCCCTTCTC 


CCACGCAGAC 


ACGATCGGCA 


CACTCAGCGG 


25020 


GTTCATCACC 


GTAATTTCAC 


TTTCCGCTTC 


GCTGGGCTCT 


TCCTCTTCCT 


CTTGCGTCCG 


25080 


CATACCACGC 


GCCACTGGGT 


CGTCTTCATT 


CAGCCGCCGC 


ACTGTGCGCT 


TACCTCCTTT 


25140 


GCCATGCTTG 


ATTAGCACCG 


GTGGGTTGCT 


GAAACCCACC 


ATTTGTAGCG 


CCACATCTTC 


25200 


TCTTTCTTCC 


TCGCTGTCCA 


CGATTACCTC 


TGGTGATGGC 


GGGCGCTCGG 


GCTTGGGAGA 


25260 


AGGGCGCTTC 


TTTTTCTTCT 


TGGGCGCAAT 


GGCCAAATCC 


GCCGCCGAGG 


TCGATGGCCG 


0^ ^ A A. 

25320 


CGGGCTGGGT 


GTGCGCGGCA 


CCAGCGCGTC 


TTGTGATGAG 


TCTTCCTCGT 


CCTCGGACTC 


25380 


GATACGCCGC 


CTCATCCGCT 


TTTTTGGGGG 


CGCCCGGGGA 


GGCGGCGGCG 


ACGGGGACGG 


25440 


GGACGACACG 


TCCTCCATGG 


TTGGGGGACG 


TCGCGCCGCA 


CCGCGTCCGC 


GCTCGGGGGT 


25500 


GGTTTCGCGC 


TGCTCCTCTT 


CCCGACTGGC 


CATTTCCTTC 


TCCTATAGGC 


AGAAAAAGAT 


25560 


CATGGAGTCA 


GTCGAGAAGA 


AGGACAGCCT 


AACCGCCCCC 


TCTGAGTTCG 


CCACCACCGC 


25620 


CTCCACCGAT 


GCCGCCAACG 


CGCCTACCAC 


CTTCCCCGTC 


GAGGCACCCC 


CGCTTGAGGA 


25680 


GGAGGAAGTG 


ATTATCGAGC 


AGGACCCAGG 


TTTTGTAAGC 


GAAGACGACG 


AGGACCGCTC 


25740 


AGTACCAACA 


GAGGATAAAA 


AGCAAGACCA 


GGACAACGCA 


GAGGCAAACG 


AGGAACAAGT 


25800 


CGGGCGGGGG 


GACGAAAGGC 


ATGGCGACTA 


CCTAGATGTG 


GGAGACGACG 


TGCTGTTGAA 


25860 


GCATCTGCAG 


CGCCAGTGCG 


CCATTATCTG 


CGACGCGTTG 


CAAGAGCGCA 


GCGATGTGCC 


25920 


CCTCGCCATA 


GCGGATGTCA 


GCCTTGCCTA 


CGAACGCCAC 


CTATTCTCAC 


CGCGCGTACC 


25980 


CCCCAAACGC 


CAAGAAAACG 


GCACATGCGA 


GCCCAACCCG 


CGCCTCAACT 


TCTACCCCGT 


26040 


ATTTGCCGTG 


CCAGAGGTGC 


TTGCCACCTA 


TCACATCTTT 


TTCCAAAACT 


GCAAGATACC 


26100 


CCTATCCTGC 


CGTGCCAACC 


GCAGCCGAGC 


GGACAAGCAG 


CTGGCCTTGC 


GGCAGGGCGC 


26160 


TGTCATACCT 


GATATCGCCT 


CGCTCAACGA 


AGTGCCAAAA 


ATCTTTGAGG 


GTCTTGGACG 


26220 


CGACGAGAAG 


CGCGCGGCAA 


ACGCTCTGCA 


ACAGGAAAAC 


AGCGAAAATG 


AAAGTCACTC 


26280 
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TGGAGTGTTG GTGGAACTCG AGGGTGACAA CGCGCGCCTA GCCGTACTAA AACGCAGCAT 26340 

CGAGGTCACC CACTTTGCCT ACCCGGCACT TAACCTACCC CCCAAGGTCA TGAGCACAGT 26400 

CATGAGTGAG CTGATCGTGC GCCGTGCGCA GCCCCTGGAG AGGGATGCAA ATTTGCAAGA 26460 

ACAAACAGAG GAGGGCCTAC CCGCAGTTGG CGACGAGCAG CTAGCGCGCT GGCTTCAAAC 26520 

GCGCGAGCCT GCCGACTTGG AGGAGCGACG CAAACTAATG ATGGCCGCAG TGCTCGTTAC 26580 

CGTGGAGCTT GAGTGCATGC AGCGGTTCTT TGCTGACCCG GAGATGCAGC GCAAGCTAGA 26640 

GGAAACATTG CACTACACCT TTCGACAGGG CTACGTACGC CAGGCCTGCA AGATCTCCAA 26700 

CGTGGAGCTC TGCAACCTGG TCTCCTACCT TGGAATTTTG CACGAAAACC GCCTTGGGCA 26760 

AAACGTGCTT CATTCCACGC TCAAGGGCGA GGCGCGCCGC GACTACGTCC GCGACTGCGT 26820 

TTACTTATTT CTATGCTACA CCTGGCAGAC GGCCATGGGC GTTTGGCAGC AGTGCTTGGA 26880 

GGAGTGCAAC CTCAAGGAGC TGCAGAAACT GCTAAAGCAA AACTTGAAGG ACCTATGGAC 26940 

GGCCTTCAAC GAGCGCTCCG TGGCCGCGCA CCTGGCGGAC ATCATTTTCC CCGAACGCCT 27000 

GCTTAAAACC CTGCAACAGG GTCTGCCAGA CTTCACCAGT CAAAGCATGT TGCAGAACTT 27060 

TAGGAACTTT ATCCTAGAGC GCTCAGGAAT CTTGCCCGCC ACCTGCTGTG CACTTCCTAG 27120 

CGACTTTGTG CCCATTAAGT ACCGCGAATG CCCTCCGCCG CTTTGGGGCC ACTGCTACCT 27180 

TCTGCAGCTA GCCAACTACC TTGCCTACCA CTCTGACATA ATGGAAGACG TGAGCGGTGA 27240 

CGGTCTACTG GAGTGTCACT GTCGCTGCAA CCTATGCACC CCGCACCGCT CCCTGGTTTG 27300 

CAATTCGCAG CTGCTTAACG AAAGTCAAAT TATCGGTACC TTTGAGCTGC AGGGTCCCTC 27360 

GCCTGACGAA AAGTCCGCGG CTCCGGGGTT GAAACTCACT CCGGGGCTGT GGACGTCGGC 27420 

TTACCTTCGC AAATTTGTAC CTGAGGACTA CCACGCCCAC GAGATTAGGT TCTACGAAGA 27480 

CCAATCCCGC CCGCCAAATG CGGAGCTTAC CGCCTGCGTC ATTACCCAGG GCCACATTCT 27540 

TGGCCAATTG CAAGCCATCA ACAAAGCCCG CCAAGAGTTT CTGCTACGAA AGGGACGGGG 27600 

GGTTTACTTG GACCCCCAGT CCGGCGAGGA GCTCAACCCA ATCCCCCCGC CGCCGCAGCC 27660 

CTATCAGCAG CAGCCGCGGG CCCTTGCTTC CCAGGATGGC ACCCAAAAAG AAGCTGCAGC 27720 

TGCCGCCGCC ACCCACGGAC GAGGAGGAAT ACTGGGACAG TCAGGCAGAG GAGGTTTTGG 27780 

ACGAGGAGGA GGAGGACATG ATGGAAGACT GGGAGAGCCT AGACGAGGAA GCTTCCGAGG 27840 

TCGAAGAGGT GTCAGACGAA ACACCGTCAC CCTCGGTCGC ATTCCCCTCG CCGGCGCCCC 27900 

AGAAATCGGC AACCGGTTCC AGCATGGCTA CAACCTCCGC TCCTCAGGCG CCGCCGGCAC 27960 

TGCCCGTTCG CCGACCCAAC CGTAGATGGG ACACCACTGG AACCAGGGCC GGTAAGTCCA 28020 

AGCAGCCGCC GCCGTTAGCC CAAGAGCAAC AACAGCGCCA AGGCTACCGC TCATGGCGCG 28080 

GGCACAAGAA CGCCATAGTT GCTTGCTTGC AAGACTGTGG GGGCAACATC TCCTTCGCCC 28140 

GCCGCTTTCT TCTCTACCAT CACGGCGTGG CCTTCCCCCG TAACATCCTG CATTACTACC 28200 

GTCATCTCTA CAGCCCATAC TGCACCGGCG GCAGCGGCAG CGGCAGCAAC AGCAGCGGCC 28260 
ACACAGAAGC AAAGGCGACC GGATAGCAAG ACTCTGACAA AGCCCAAGAA ATCCACAGCG 28320 
GCGGCAGCAG CAGGAGGAGG AGCGCTGCGT CTGGCGCCCA ACGAACCCGT ATCGACCCGC 28380 
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GAGCTTAGAA 


ACAGGATTTT 


TCCCACTCTG 


TATGCTATAT 


TTCAACAGAG 


CAGGGGCCAA 


28440 


GAACAAGAGC 


TGAAAATAAA 


AAACAGGTCT 


CTGCGATCCC 


TCACCCGCAG 


CTGCCTGTAT 


28500 


CACAAAAGCG 


AAGATCAGCT 


TCGGCGCACG 


CTGGAAGACG 


CGGAGGCTCT 


CTTCAGTAAA 


28560 


TACTGCGCGC 


TGACTCTTAA 


GGACTAGTTT 


CGCGCCCTTT 


CTCAAATTTA AGCGCGAAAA 


28620 


CTACGTCATC 


TCCAGCGGCC 


ACACCCGGCG 


CCAGCACCTG 


TCGTCAGCGC 


CATTATGAGC 


28660 


AAGGAAATTC 


CCACGCCCTA 


CATGTGGAGT 


TACCAGCCAC 


AAATGGGACT 


TGCGGCTGGA 


28740 


GCTGCCCAAG 


ACTACTCAAC 


CCGAATAAAC 


TACATGAGCG 


CGGGACCCCA 


CATGATATCC 


28800 


CGGGTCAACG 


GAATCCGCGC 


CCACCGAAAC 


CGAATTHTPT 


TGGAACAGGC 


GGCTATTACC 


28860 


ACCACACCTC 


GTAATAACCT 


TAATCCCCGT 


MUX XV3UIV»^^V9 


CTGCCCTGGT 


GTACCAGGAA 


28920 


AGTCCCGCTC 


CCACCACTGT 


GGTACTTCCC 




AGGCCGAAGT 


TCAGATGACT 


28980 


AACTCAGGGG 


CGCAGCTTGC 


GGGCGGCTTT 




TGCGGTCGCC 


CGGGCAGGGT 


29040 


ATAACTCACC 


TGACAATCAG 


AGGGCGAGGT 


ZV TTP AGPTP TV 


ACGACGAGTC 


GGTGAGCTrr 

WW X w<^wVw X WW 


29100 


TCGCTTGGTC 


TCCGTCCGGA 


CGGGACATTT 


P ZVG 2i TPfirSPP 


GCGCCGGCCG 


TCCTTPATTP 

X ^ X X X X w 


29160 


ACGCCTCGTC 


AGGCAATCCT 


AACTCTGCAG 


APPTPGTPPT 


CTGAGCCGCG 


CTCTGGAGGC 


29220 


ATTGGAACTC 


TGCAATTTAT 


TGAGGAGTTT 

X N3nV3\JAU XXX 




TCTACTTTAA 


CCCCTTCTCG 

vW X X w X Ww 


2928.0 


GGACCTCCCG 


GCCACTATCC 


GGATPAATTT 
ww<nx w^^/^x X X 


jv TTPPT A a PT* 


TTGACGCGGT 


AAAGGACTCG 


29340 


GCGGACGGCT 


ACGACTGAAT 


GTT AATT A AG 

wx xnnx X/U%nI 


TTPPT/^*rPP IV 


TCCGCACCCA 


CTATCTTCAT 

^ X rL X w X X X 


29400 


GTTGTTGCAG 


ATGAAGCGCG 


CAAGACCGTd 


TGZi AfiATAPP 


TTCAACCCCG 


TGTATCCATA 


29460 


TGACACGGAA 


ACCGGTCCTC 


CAAfTGTGPP 


X X X X L. X X X 


CCTCCCTTTG 


TATPPPPPA A 


29520 


TGGGTTTCAA 


GAGAGTCCCC 


CTGGGGTAfT 


PTPTTTHP^P 
v» XV,. X X XuLtoC 


CTATCCGAAC 


PTPTAGTTAP 
^ X V» X X X rv^ 


29580 


CTCCAATGGC 


ATGCTTGCGC 


TCAAAATGGG 

X Vw^t^W^/^ X \3\J\3 


p TV a nnnrT'Tr* 


TCTCTGGACG 


AGGPPnriPiv IV 


29640 


CCTTACCTCC 


CAAAATGTAA 


PPAfTGTGAG 
^ X O X 


X ^ X 


AAAAAAAPPA 


AGTPA Zi APAT 


29700 


AAACCTGGAA 


ATATCTGrAP 






GCCCTAACTG 


TGGPTGPPGP 

X WW\» X wVv^UL« 


29760 


CGCACCTCTA 




Gr'A AHAPIVPT 


\. AL> L ATGu AA 


TCACAGGCCC 


CGPTAAPPGT 


29620 


GCACGACTCC 




1 xV3^v.A^CL.A 




ACAGTGTCAG 


AAGGAAAGPT 


29880 


AGCCCTGCAA 








AGTACCCTTA 


CTATCAPTG P 

w X £^ X W^^^ X W W 


29940 


CTCACCCCCT 


CTAACTACTG 


CPArTGGTAG 


V-X XoovjCAX i 


GACTTGAAAG 


AGCCCATTTA 


30000 


TACACAAAAT 


GGAAAACTAG 


GACTAAAGTA 


PGGGGPTPPT 


T TGCAT GT AA 


CAGACGACCT 


30060 


AAACACTTTG 


ACCGTAGCAA 


CTGGTCPAGG 


XV3XV7AV»XAX 1 


AATAATACTT 


CCTTGCAAAC 


30120 


TAAAGTTACT 


GGAGCCTTGG 


GTTTTGATTP 

VJ X X X X V> A X X W 


aPHaGGP A 3VT 
nVaA/iVaoL. AA X 


ATGCAACTTA 


ATGTAGCAGG 


30180 


AGGACTAAGG 


ATTGATTCTC? 

mm A A A A m 


AAAAPAGACG 


^wX XAXA^i 1 


GATGTTAGTT 


ATCCGTTTGA 


30240 


i GCTCAAAAC 


CAACTAAATC 


TAAGACTAGG 


ACAGGGCCCT 


CTTTTTATAA 


ACTCAGCCCA 


30300 


CAACTTGGAT 


ATTAACTACA 


ACAAAGGCCT 


TTACTTGTTT 


ACAGCTTCAA 


ACAATTCCAA 


30360 


AAAGCTTGAG 


GTTAACCTAA 


GCACTGCC;^ 


GGGGTTGATG 


TTTGACGCTA 


CAGCCATAGC 


30420 


CATTAATGCA 


GGAGATGGGC 


TTGAATTTGG 


TTCACCTAAT 


GCACCAAACA 


CAAATCCCCT 


30480 
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CAAAACAAAA , 


ATTGGCCATG 


GCCTAGAATT 


TGATTCAAAC 


AAGGCTATGG 


TTCCTAAACT 


30540 


AGGAACTGGC 


CTTAGTTTTG 


ACAGCACAGG 


TGCCATTACA 


GTAGGAAACA 


AAAATAATGA 


30600 


TAAGCTAACT 


TTGTGGACCA 


CACCAGCTCC 


ATCTCCTAAC 


TGTAGACTAA 


ATGCAGAGAA 


30660 


AGATGCTAAA 


CTCACTTTGG 


TCTTAACAAA 


ATGTGGCAGT 


CAAATACTTG 


CTACAGTTTC 


30720 


AGTTTTGGCT 


GTTAAAGGCA 


GTTTGGCTCC 


AATATCTGGA 


ACAGTTCAAA 


GTGCTCATCT 


30780 


TATTATAAGA 


TTTGACGAAA 


ATGGAGTGCT 


ACTAAACAAT 


TCCTTCCTGG 


ACCCAGAATA 


30840 


TTGGAACTTT 


AGAAATGGAG 


ATCTTACTGA 


AGGCACAGCC 


TATACAAACG 


CTGTTGGATT 


30900 


TATGCCTAAC 


CTATCAGCTT 


ATCCAAAATC 


TCACGGTAAA 


ACTGCCAAAA 


GTAACATTGT 


30960 


CAGTCAAGTT 


TACTTAAACG 


GAGACAAAAC 


TAAACCTGTA 


ACACTAACCA 


TTACACTAAA 


31020 


CGGTACACAG 


GAAACAGGAG 


ACACAACTCC 


AAGTGCATAC 


TCTATGTCAT 


TTTCATGGGA 


31080 


CTGGTCTGGC 


CACAACTACA 


TTAATGAAAT 


ATTTGCCACA 


TCCTCTTACA 


CTTTTTCATA 


31140 


CATTGCCCAA 


GAATAAAGAA 


TCGTTTGTGT 


TATGTTTCAA 


CGTGTTTATT 


TTTCAATTGC 


31200 


AGAAAATTTC 


AAGTCATTTT 


TCATTCAGTA 


GTATAGCCCC 


ACCACCACAT 


AGCTTATACA 


31260 


GATCACCGTA 


CCTTAATCAA 


ACTCACAGAA 


CCCTAGTATT 


CAACCTGCCA 


CCTCCCTCCC 


31320 


7UVCACACAGA 


GTACACAGTC 


CTTTCTCCCC 


GGCTGGCCTT 


A7UUVAGCATC 


ATATCATGGG 


31380 


TAACAGACAT 


ATTCTTAGGT 


GTTATATTCC 


ACACGGTTTC 


CTGTCGAGCC 


AAACGCTCAT 


31440 


CAGTGATATT 


AATAAACTCC 


CCGGGCAGCT 


CACTTAAGTT 


CATGTCGCTG 


TCCAGCTGCT 


31500 


GAGCCACAGG 


CTGCTGTCCA 


ACTTGCGGTT 


GCTTAACGGG 


CGGCGAAGGA 


GAAGTCCACG 


31560 


CCTACATGGG 


GGTAGAGTCA 


TAATCGTGCA 


TCAGGATAGG 


GCGGTGGTGC 


TGCAGCAGCG 


31620 


CGCGAATAAA 


CTGCTGCCGC 


CGCCGCTCCG 


TCCTGCAGGA 


ATACAACATG 


GCAGTGGTCT 


31680 


CCTCAGCGAT 


GATTCGCACC 


GCCCGCAGCA 


TAAGGCGCCT 


TGTCCTCCGG 


GCACAGCAGC 


31740 


GCACCCTGAT 


CTCACTTAAA 


TCAGCACAGT 


AACTGCAGCA 


CAGCACCACA 


ATATTGTTCA 


31800 


AAATCCCACA 


GTGCAAGGCG 


CTGTATCCAA 


AGCTCATGGC 


GGGGACCACA 


GAACCCACGT 


31860 


GGCCATCATA 


CCACAAGCGC 


AGGTAGATTA 


AGTGGCGACC 


CCTCATAAAC 


ACGCTGGACA 


31920 


TAAACATTAC 


CTCTTTTGGC 


ATGTTGTAAT 


TCACCACCTC 


CCGGTACCAT 


ATAAACCTCT 


31980 


GATTAAACAT 


GGCGCCATCC 


ACCACCATCC 


TAAACCAGCT 


GGCCAAAACC 


TGCCCGCCGG 


32040 


CTATACACTG 


CAGGGAACCG 


GGACTGGAAC 


AATGACAGTG 


GAGAGCCCAG 


GACTCGTAAC 


32100 


CATGGATCAT 


CATGCTCGTC 


ATGATATCAA 


TGTTGGCACA 


ACACAGGCAC 


ACGTGCATAC 


32160 


ACTTCCTCAG 


GATTACAAGC 


TCCTCCCGCG 


TTAGAACCAT 


ATCCCAGGGA 


ACAACCCATT 


32220 


CCTGAATCAG 


CGTAAATCCC 


ACACTGCAGG 


GAAGACCTCG 


CACGTAACTC 


ACGTTGTGCA 


32280 


TTGTCAAAGT 


GTTACATTCG 


; GGCAGCAGCG 


GATGATCCTC 


CAGTATGGTA 


GCGCGGGTTT 


32340 


CTGTCTCAAA 


. AGGAGGTAGA CGATCCCTAC 


: TGTACGGAGT 


GCGCCGAGAC 


AACCGAGATC 


32400 


GTGTTGGTCG 


\ TAGTGTCATG CCAAATGGAA CGCCGGACGT 


AGTCATATTT 


CCTGAAGCAA 


32460 


AACCAGGTGC 


: GGGCGTGACA AACAGATCTG CGTCTCCGGT 


' CTCGCCGCTT 


AGATCGCTCT 


32520 


GTGTAGTAGT TGTAGTATAT CCACTCTCTC AAAGCATCCA 


. GGCGCCCCCT 


GGCTTCGGGT 


32580 
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TCTATGTAAA 


CTCCTTCATG 


CGCCGCTGCC 


CTGJXT A A P a T 




AGAATAAGCC 


32640 


ACACCCAGCC 


AACCTACACA 


TTCGTTCTGC 


GAGTCACJVCa 


Pfin^^ TV no a pp 


^^^^^^ TV TV TV 

vjGGAAGAGCT 


32700 


GGAAGAACCA 


TGTTTTTTTT 


TTTATTCCAA 


AAGATT ATP C 


ZiZi 2i UPPTPJi a 


A A ^n^* A TV TV ^n^^ 

AATGAAGATC 


32760 


TATTAAGTGA 


ACGCGCTCCC 


CTCCGGTGGC 


GTGGTCAAAC 


TPTapanppa 


R TV TV TV TV H fn 

AAbAACAGAT 


32820 


AATGGCATTT 


GTAAGATGTT 


GCACAATGGC 


TTCCAAAAfSn 


P Zi A 2i Pf2/2PP P 


1 v-ACGTCCAA 


32880 


GTGGACGTAA 


AGGCTAAACC 


CTTCAGGGTG 


AATPTPPT PT 


iilAAAV-Al xL. 


CAGCACCTTC 


32940 


AACCATGCCC 


AAATAATTCT 


CATCTCGCCA 


PPTTPTP Ji 

X X Vi. X\»AM1 




GCAAATCCCG 


33000 


AATATTAAGT 


CCGGCCATTG 


TAAAAATCTG 


PTPPAfilVfZPr! 


UCCTCCACCT 


TCAGCCTCAA 


33060 


GCAGCGAATC 


ATGATTGCAA 


AAATTCAGGT 


TPPTPa P & n & 


(.uToTATAAG 


ATTCAAAAGC 


33120 


GGAACATTAA 


CAAAAATACC 


GCGATCCCGT 


21fiGTPPPTTP 


vaCAGGGCCAG 


CTGAACATAA 


33180 


TCGTGCAGGT 


CTGCACGGAC 


CAGCGCGGCP 


2iPT*rPPPPPP 


C-AGGAACCTT 


GACAAAAGAA 


33240 


CCCACACTGA 


TTATGACACG 


CATACTCGGA 


rSPTliTGPTa a 
wdtf X AloL. X/lii 


L (J AGCGTAGC 


CCCGATGTAA 


33300 


GCTTTGTTGC 


ATGGGCGGCG 


ATATAAAATR 


P 2X ntzfymc^vr' 

V^/inviwr i V3V» X u 


uTCAAAAAAT 


CAGGCAAAGC 


33360 


CTCGCGCAAA 


AAAGAAAGCA 


CATCGTAGTC 


nx w\«. X X 


A\jA TAAAGGC 


AGGTAAGCTC 


33420 


CGGAACCACC 


ACAGAAAAAG 


ACACCATTTT 


TPTPTP Zl & 21 P 
X ^ X I. X wiwviW 


A X GTCTGCGG 


GTTTCTGCAT 


33480 


AAACACAAAA 


TAAAATAACA 


AAAAAACATT 




AAvjCCTGTCT 


TACAACAGGA 


33540 


AAAACAACCC 


TTATAAGCAT 


AAGACG6ACT 


A PGrSPP 21 fP P 


UGG CGTGACC 


GTAAAAAAAC 


33600 


TGGTCACCGT 


GATTAAAAAG 


CACCACCGAC 


AGCTCCTCGG 


1 LATGTCCGG 


AGTCATAATG 


33660 


TAAGACTCGG 


TAAACACATC 


AGGTTGATTC 


ATCGGTCAGT 


^^T'TV A TV TV Tl^/^ 

VJL.TAAAAAGC 


GACCGAAATA 


33720 


GCCCGGGGGA 


ATACATACCC 


GCAGGCGTAG 


AGACAACATT 


ACAGCCCCCA 


TAGGAGGTAT 


33780 


AACAAAATTA 


ATAGGAGAGA 


AAAACACATA AACACCTGAA 


AAACCCTCCT 


GCCTAGGCAA 


33840 


AATAGCACCC 


TCCCGCTCCA 


GAACAACATA 


CAGCGCTTCA 


CAGCGGCAGC 


CTAACAGTCA 


33900 


GCCTTACCAG 


TAAAAAAGAA 


AACCTATTAA AAAAACACCA 


CTCGACACGG 


CACCAGCTCA 


33960 


ATCAGTCACA 


GTGTAAAAAA 


GGGCCAAGTG 


CAGAGCGAGT 


TV TV FTTTV ^^^^ ^ 

ATATATAGGA 


CTAAAAAATG 


34020 


ACGTAACGGT 


TAAAGTCCAC 


AAAAAACACC 


CAGAAAACCG 


CACGCGAACC 


TACGCCCAGA 


34080 




•AAAAAACCCA 


CAACTTCCTC 


AAATCGTCAC 


TTCCGTTTTC 


CCACGTTACG 


34140 


TAACTTCCCA 


TTTTAAGAAA 


ACTACAATTC 


CCAACACATA 


CAAGTTACTC 


CGCCCTAAAA 


34200 


CCTACGTCAC 


CCGCCCCGTT 


CCCACGCCCC 


GCGCCACGTC 


ACAAACTCCA 


CCCCCTCATT 


34260 


ATCATATTGG 


CTTCAATCCA 


AAATAAGGTA 


TATTATTGAT 


GAT 




34303 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S: 
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CCAGGCT7TA 


CACTTTATGC 


TTCCGGCTCG 


TATGTTGTGT 


GGAATTGTGA 


GCGGATAACA 


60 


ATTTCACACA 


GGAAACAGCT 


ATGACCATGA 


TTACGCCAAG 


CGCGCAATTA 


ACCCTCACTA 


120 


AAGGGAACAA 


AAGCTGGGTA 


CCGGGCCCCC 


CCTCGAGGTC 


GACGGTATCG 


ATAAGCTTAC 


180 


GCGTGGCCTA 


GGCGGCCGAA 


TTCCTGCAGC 


CCGGGGGATC 


CACTAGTTCT 


AGAGCGGCCG 


240 


CCACCGCGGC 


GCCTTAATTA 


ATACGTAAGC 


TCCAATTCGC 


CCTATAGTGA 


GTCGTATTAC 


300 


GCGCGCTCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


CGTTACCCAA 


360 


CTTAATCGCC 


TTGCAGCACA 










380 



(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGCCGCAGCA CCGGATGCAT C 21 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

GCGTCCGGAG GCTGCCATGC GGCAGG6 27 

(2) INFORMATION FOR SEQ ID NO: 8: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GGCTGCCATG 


CGGCAGGGAT 


ACGGCGCTAA 


CGATGCATCT 


CAACAATTGT 


TGTGTAGGTA 


60 


CTCCGCCGCC 


GAGGGACCTG 


AGCGAGTCCG 


CATCGACCGG 


ATCGGAAAAC 


CTCTCGAGAA 


120 


AGGCGTCTAA 


CCAGTCACAG 


TCGCAAGGTA 


GGCTGAGCAC 


CGTGGCGGGC 


GGCAGCGGGC 


180 


GGCGGTCGGG 


GTTGTTTCTG 


GCGGAGGTGC 


TGCTGATGAT 


GTAATTAAAG 


TAGGCGGTCT 


240 


TGAGACGGCG 


GATGGTCGAC 


AGAAGCACCA 


TGTCCTTGGG 


TCCGGCCTGC 


TGAATGCGCA 


300 


GGCGGTCGGC 


CATGCCCCAG 


GCTTCGTTTT 


GACATCGGCG 


CAGGTCTTTG 


TAGTAGTCTT 


360 


GCATGAGCCT 


TTCTACCGGC 


ACTTCTTCTT 


CTCCTTCCTC 


TT6TCCTGCA 


TCTCTTGCAT 


420 
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CTATCGCTGC 


GGCGGCGGCG 


GAGTTTGGCC 


GTAGGTGGCG 


WWWXWX X^WX 


w L ATGCGTG 


480 


TGACCCCGAA 


GCCCCTCATC 


GGCTGAAGCA 


GGGCTAGGTC 




v-vjc 1 CGGCTA 


540 


ATATGGCCTG 


CTGCACCTGC 


GTGAGGGTAG 


ACTGGAAGTC! 


ATPPATf^TPP 
/^l WWaI Vj 1 ^ W 


R ^ A TV 71 O/^/^/ '111! 

Ai.AAAGCGGT 


600 


GGTATGCGCC 


CGTGTTGATG 


GTGTAAGTGC 


AGTTGGCCAT 


AAPnnzippan 


1 1 AAL.GGTCT 


660 


GGTGACCCGG 


CTGCGAGAGC 


TCGGTGTACC 


TG AG Arn rn A 


V3 X MiioWUL. X 


vjAGTCAAATA 


720 


CGTAGTCGTT 


GCAAGTCCGC 


ACCAGGTACT 






GGCGGCGGCT 


780 


GGCGGTAGAG 


GGGCCAGCGT 






(auL.uAVjAxL 1 


TCCAACATAA 


840 


GGCGATGATA 


TCCGTAGATG 


TACCTGGAPA 


X \» Wn\jv X u#i 1 




GTGGTGGAGG 


900 


CGCGCGGAAA 


GTCGCGGACG 


CGGTTCCAGA 


1 vJ 1 1 VaWutV-nu 




TGCTCCATGG 


960 


TCGGGACGCT 


CTGGCCGGTC 


n w W W W \i» V9 Vw w V» 


ivil UVj X X viiiw 


1 t-TACCGT 


GCAAAAGGAG 


1020 


AGCCTGTAAG 


CGGGCACTCT 


TCCdTCZClTCr 

X Www X X W X 


Vao X VJVJn XiiA/i 


I TCQCAAGG6 


TATCATGGCG 


1080 


GACGACCGGG 


GTTCGAGCCC 


CnTATCCClCZC 

Wu X V«\vWwW 




UAx UuATGCG 


GTTACCGCCC 


1140 


GCGTGTCGAA 


CCCAGGTGTG 






tj 1 VjCTCCTT T 


TGGCTTCCTT 


1200 


CCAGGCGCGG 


CGGCTGCTGC 




X iUUIV.CnwXU 


^jl. l cgcg c a 


GCGTAAGCGG 


1260 


rTAGGCTGGA 


AAGCGAAAGC 


ATTAAGTGGC 


TCGCTCCCTG 


TAGCCGGAGG 


GTTATTTTCC 


1320 


AAGGGTTGAG 


TCGCGGGACC 


CCCGGTTCGA 


GTCTCGGACC 


GGCCGGACTG 


CGGCGAACGG 


1380 


GGGTTTGCCT 


CCCCGTCATG 


CAAGACCCCG 


CTTGCAAATT 


CCTCCGGAAA 


CAGGGACGAG 


1440 


CCCCTTTTTT 


GCTTTTCCCA 


GATGCATCCG 


GTGCTGCGGC 


A 




1481 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GAATTCCCCA 


TCCTGGTCTA 


TAGA6AGAGT 


TCCAGAACAG 


CCAGGGCTAC 


AGATAAACCC 


60 


ATCTGGAAAA 


ACAAAGTTGA 


ATGACCCAAG 


AGGGGTTCTC 


AGAGGGTGGC 


GTGTGCTCCC 


120 


TGGCAAGCCT 


ATGACATGGC 


CGGGGCCTGC 


CTCTCTCTGC 


CTCTGACCCT 


CAGTGGCTCC 


160 


CATGAACTCC 


TTGCCCAATG 


GCATCTTTTT 


CCTGCGCTCC 


TTGGGTTATT 


CCAGTCTCCC 


240 


CTCAGCATTC 


CTTCCTCAGG 


GCCTCGCTCT 


TCTCTCTGCT 


CCCTCCTTGC 


ACAGCTGGCT 


300 


CTGTCCACCT 


CAGATGTCAC 


AGTGCTCTCT 


CAGAGGAGGA 


AGGCACCATG 


TACCCTCTGT 


360 


TTCCCAGGTA 


AGGGTTCAAT 


TTTTAAAAAT 


GGTTTTTTGT 


TTGTTTGTTT 


GTTTGTTTGT 


420 


TTGTTTGTTT 


TTCAAGACAG 


GGCTCCTCTG 


TGTAGTCCTA 


ACTGTCTTGA 


AACTCCCTCT 


480 


GTAGACCAGG 


TCGACCTCGA 


ACTCTTGAAA 


CCTGCCACGG 


ACCACCCAGT 


CAGGTATGGA 


540 


GGTCCCTGGA 


ATGAGCGTCC 


TCGAAGCTAG 


GTGGGTAAGG 


GTTCGGCGGT 


GACAAACAGA 


600 


AACAAACACA 


GAGGCAGTTT 


GAATCTGAGT 


GTATTTTGCA 


GCTCTCAAGC 


AGGGGATTTT 


660 
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ATACATAAAA 


AAAAAAAAAA 


AAAAAAAACC 


AAACATTACA 


TCTCTTAGAA 


ACTATATCCA 


720 


ATGAAACAAT 


CACAGATACC 


AACCAAAACC 


ATTGGGCAGA 


GTAAAGCACA 


AAAATCATCC 


• 780 


AAGCATTACA 


ACTCTGAAAC 


CATGTATTCA 


GTGAATCACA 


AACAGAACAG 


GTAACATCAT 


840 


TATTAATATA 


AATCACCAAA 


ATATAACAAT 


TCTAAAAGGA 


TGTATCCAGT 


GGGGGCTGTC 


900 


GTCCAAGGCT 


AGTGGCAGAT 


TTCCAGGAGC 


AGGTTAGTAA 


ATCTTAACCA 


CTGAACTAAC 


960 


TCTCCAGCCC 


CATGGTCAAT 


TATTATTTAG 


CATCTAGTGC 


CTAATTTTTT 


TTTATAAATC 


1020 


TTCACTATGT 


AATTTAAAAC 


TATTTTAATT 


CTTCCTAATT 


AAGGCTTTCT 


TTACCATATA 


1080 


CCAAAATTCA 


CCTCCAATGA 


CACACGCGTA 


GCCATATGAA 


ATTTTATTGT 


TGGGAAAATT 


1140 


TGTACCTATC ATAATAGTTT 

« 


TGTAAATGAT 


TTAAAAAGCA 


AAGTGTTAGC 


CGGGCGTGGT 


1200 


GGCACACGCC 


TTT AATC C CT 


GCACTCGGGA 


GGCAGGGGCA 


GGAGGATTTC 


TGAGTTTGAG 


1260 


GCCAGCCTGG 


TCTACAGAGT 


GAGTTCCAGG 


ACAGCCAGGG 


CTACACAGAG 


AAACCCTGTC 


1320 


TCGAACCCCC 


CACCCCCCAA 


AAAAAGCAAA 


GTGTTGGTTT 


CCTTGGGGAT 


AAAGTCATGT 


1380 




X ^ X V» X n\3\3\^\^ 


CATCTCACCC 


ATTATTCTCG 


CTTAAGATCT 


TGGCCTAGGC 


1440 


TArCAGGAAC 


nx \3 xnnnxxix% 


GAAAAGGAAT 


AAGAGAAAAC 


AAAACAGAGA 


GATTGCCATG 


1500 






TTTCTCTCCG 


GCGAAGAGTT 


CCACAACCAT 


CTCCAGGAGG 


1560 


CCTCCACGTT 


TTGAC3GTPAA 


TGGCCTCAGT 


CTGTGGAACT 


TGTCACACAG 


ATCTTACTGG 


1620 


AGGTGGTGTG 




ATTCCTTTTA 


GTGTCTTGGG 


CTAAAAGTAA 


AAGGCCCAGA 


1680 


GGAGGCCTTT 


GCTCATCTGA 


CCATGCTGAC 


AAGGAACACG 


GGTGCCAGGA 


CAGAGGCTGG 


1740 


ACCCCAGGAA 


CACCTTAAAC 


ACTTCTTCCC 


TTCTCCGCCC 


CCTAGAGCAG 


GCTCCCCTCA 


1800 


CCAGCCTGGG 


CAGAAATGGG 


GGAAGATGGA 


GTGAAGCCAT 


ACTGGCTACT 


CCAGAATCAA 


1860 


CAGAGGGAGC 


CGGGGGC7VAT 


ACTGGAGAAG 


CTGGTCTCCC 


CCCAGGGGCA 


ATCCTGGCAC 


1920 


CTCCCAGGCA 


GAAGAGGAAA 


CTTCCACAGT 


GCATCTCACT 


TCCATGAATC 


CCCTCCTCGG 


1980 


ACTCTGAGGT 


CCTTGGTCAC 


AGCTGAGGTG 


CAAAAGGCTC 


CTGTCATATT 


GTGTCCTGCT 


2040 


CTGGTCTGCC 


TTCCACAGCT 


TGGGGGCCAC 


CTAGCCCACC 


TCTCCCTAGG 


GATGAGAGCA 


2100 


GCCACTACGG 


GTCTAGGCTG 


CCCATGTAAG 


GAGGCAAGGC 


CTGGGGACAC 


CCGAGATGCC 


2160 


TGGTTATAAT 


TAACCCAGAC 


ATGTGGCTGC 


CCCCCCCCCC 


CCAACACCTG 


CTGCCTGAGC 


2220 


CTCACCCCCA 


CCCCGGTGCC 


TGGGTCTTAG 


GCTCTGTACA 


CCATGGAGGA 


GAAGCTCGCT 


2280 


CTAAAAATAA 


CCCTGTCCCT 


GGTGGATCCA 


GGGTGAGGGG 


CAGGCTGAGG 


GCGGCCACTT 


2340 


CCCTCAGCCG 


CAGGTTTGTT 


TTCCCAAGAA 


TGGTTTTTCT 


GCTTCTGTAG 


CTTTTCCTGT 


2400 


CAATTCTGCC 


ATGGTGGAGC 


AGCCTGCACT 


GGGCTTCTGG 


GAGAAACCAA 


ACCGGGTTCT 


2460 


AACCTTTCAG 


CTACAGTTAT 


TGCCTTTCCT 


GTAGATGGGC 


GACTACAGCC 


CCACCCCCAC 


2520 


CCCCGTCTCC 


TGTATCCTTC 


CTGGGCCTGG 


GGATCCTAGG 


CTTTCACTGG 


AAATTTCCCC 


2580 


CCAGGTGCTG 


TAGGCTAGAG 


TCACGGCTCC 


CAAGAACAGT 


GCTTGCCTGG 


CATGCATGGT 


2640 


TCTGAACCTC 


CAACTGCAAA AAATGACACA 


TACCTTGACC 


CTTGGAAGGC 


TGAGGCAGGG 


2700 


GGATTGCCAT 


GAGTGCAAAG 


CCAGACTGGG 


TGGCATAGTT 


AGACCCTGTC 


TCAAAAAACC 


2760 
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GAGGCAGAGG 2820 


fiATTflTTAr'A 


i 1 CI bAGGC CAGCCTGGAC 


TACATAGGGT 


TTCAGGCTAG 


CCCTGTCTAC 


2880 




v-v-iATTTCAA AAACACAAAC 


AAAATGGTTC 


TCCCAGCTGC 


TAATGCTCAC 


2940 


CAGGCAATt^A 


AGCCTGGTGA GCATTAGCAA 


TGAAGGCAAT 


GAAGGAGGGT 


GCTGGCTACA 


3000 

\m 


ATf AArtGr*Tf2 


TGGGGGACTG AGGGCAGGCT 


GTAACAGGCT 


TGGGGGCCAG 


GGCTTATACG 


3060 


TGCCTCiCiCi A P 


TCCCAAAGTA TTACTGTTCC ATGTTCCCGG 


CGAAGGGCCA 


GCTGTCCCCC 


312 0 




CTCAGCACTT AGTTTAGGAA 


CCAGTGAGCA 


AGTCAGCCCT 


TGGGGCAGCC 


3180 




CATGGGGCTG GGCAAGCTGC 


ACGCCTGGGT 


CCGGGGTGGG 


CACGGTGCCC 


3240 




CTGAAAGCTC ATCTGCTCTC 


AGGGGCCCCT 


CCCTGGGGAC 


AGCCCCTCCT 


3300 


GGCTAGTCAC 


ACCCTGTAGG CTCCTCTATA 


TAACCCAGGG 


GCACAGGGGC 


TGCCCCCGGG 


3360 


TCAC 










3364 


(2) INFORMATION FOR SEQ ID NO : 10 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTACCACTA CGGGTCTAGG CTGCCCATGT AAGGAGGCAA GGCCTGGGGA CACCCGAGAT 60 
GCCTGGTTAT AATTAACCCC AACACCTGCT GCCCCCCCCC CCCCAACACC TGCTGCCTGA 120 
GCCTGAGCGG TTACCCCACC CCGGTGCCTG GGTCTTAGGC TCTGTACACC ATGG 174 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGATCTTCCT AGAAATTCGC TGTCTGCGAG GGCCGGCTGT TGGGGTGAGT ACTCCCTCTC 60 

AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 120 

ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCG TCCATCTGGT CAGAAAAGAC 180 

AATCTTTTTG TTGTCAAGCT TGAGGTGTGG CAGGCTTGAG ATCGATCTGG CCATACACTT 240 

GAGTGACAAT GACATCCACT TTGCCTTTCT CTCCACAGGT GTCCACTCCC AGGTCCAACC 300 

GCGGATCTCC CGGGACCATG CCCAAGAAGA AGAGGAAGGT GTCCAATTTA CTGACCGTAC 360 

ACCAAAATTT GCCTGCATTA CCGGTCGATG CAACGAGTGA TGAGGTTCGC AAGAACCTGA 420 

TGGACATGTT CAGGGATCGC CAGGCGTTTT CTGAGCATAC CTGGAAAATG CTTCTGTCCG 480 
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TTTGCCGGTC GTGGGCGGCA TGGTGCAAGT TGAATAACCG GAAATGGTTT CCCGCAGAAC 



540 



CTGAAGATGT TCGCGATTAT CTTCTATATC TTCAGGCGCG CGGTCTGGCA GTAAAAACTA 



600 



TCCAGCAACA TTTGGGCCAG CTAAACATGC TTCATCGTCG GTCCGGGCTG CCACGACCAA 



660 



GTGACAGCAA TGCTGTTTCA CTGGTTATGC GGCGGATCC 



699 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GAAGATCTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATTACCGAAG AAATGGCTCG 60 

AGATCTTC 68 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /deSC = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GAAGATCTCG AGCCATTTCT TCGGTAATAA CTTCGTATAG CATACATTAT ACGAAGTTAT 60 

AGATCTTC 68 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCACATGTAT AACTTCGTAT AGCATACATT ATACGAAGTT ATACATGTGG 50 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCACATGTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATACATGTGG 50 
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CLAIMS 

1 . A recombinant plasmid, comprising in operable combination: 

a) a plasmid backbone, comprising an origin of replication, an 

antibiotic resistance gene and a eukaryotic promoter element; 
5 b) the left and right inverted terminal repeats (ITRs) of 

adenovirus, said ITRs each having a 5' and a 3' end and arranged in a tail 

to tail orientation on said plasmid backbone; 

c) the adenovirus packaging sequence, said packaging sequence 

having a 5' and a 3' end and linked to one of said ITRs; and 

'° d) a first gene of interest operably linked to said promoter 

element. 



2. The recombinant plasmid of Claim 1 wherein the total size of said 
plasmid is between 27 and 40 kilobase pairs. 

3. The recombinant plasmid of Claim 1, wherein said 5' end of said 
15 packaging sequence is linked to said 3' end of said left ITR. 

4. The recombinant plasmid of Claim 3, wherein said first gene of 
interest is linked to said 3' end of said packaging sequence. 

5. The recombinant plasmid of Claim 4, wherein said first gene of 
interest is the dystrophin cDNA gene. 

6. The recombinant plasmid of Claim 4, further comprising a second 
gene of interest. 

7. The recombinant plasmid of Claim 6, wherein said second gene of 
interest is linked to said 3' end of said right ITR. 



25 



8. The recombinant plasmid of Claim 7, wherein said second gene of 
interest is a reporter gene. 
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9. The recombinant plasmid of Claim 8. wherein said reporter gene is 
selected from the group consisting of the E. coli P-galactosidase gene, the human 
placental alkaline phosphatase gene, the green fluroescent protein gene and the 
choramphenicol acetyltransferase gene, 

5 10. A mammalian cell line containing the recombinant plasmid of 

Claim 1. 



11. The cell line of Claim 10, wherein said cell line is a 293-derived cell 



line. 



12. A helper adenovirus comprising i) first and a second loxP sequences, 
10 and ii) the adenovirus packaging sequence, said packaging sequence having a 5' 
and a 3* end. 

13- The helper adenovirus of Claim 12, wherein said first loxP sequence 
is linked to the 5' end of said packaging sequence and said second loxP sequence is 
linked to said 3' end of said packaging sequence. 

15 14. The helper adenovirus of Claim 12, further comprising at least one 

adenovirus gene coding region. 

15. A mammalian cell line, comprising: 

a) a recombinant plasmid. comprising, in operable combination: 

i) a plasmid backbone, comprising an origin of 

20 replication, an antibiotic resistance gene and a eukaryotic promoter 

element, 

ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in 
a tail to tail orientation on said plasmid backbone, 

25 iii) the adenovirus packaging sequence, said packaging 

sequence having a 5' and a 3* end and linked to one of said ITRs, 
and 
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iv) a first gene of interest operably linked to said 
promoter element: and 

b) a helper adenovirus comprising i) first and a second loxP 
sequences, and ii) the adenovirus packaging sequence, said packaging 
5 sequence having a 5" and a 3' end. 

16. The cell line of Claim 15, wherein said helper adenovirus further 
comprises at least one adenovirus gene coding region. 

17. The cell line of Claim 15, wherein said first gene of interest is the 
dystrophin cDNA gene 

18. A method of producing an adenovirus minichromosome. comprising: 

A) providing a mammalian cell line containing: 

a) a recombinant plasmid, comprising, in operable 
combination, i) a plasmid backbone, comprising an origin of 
replication, an antibiotic resistance gene and a eukarotic promoter 
element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3" end and arranged in 
a tail to tail orientation on said plasmid backbone, iii) the adenovirus 
packaging sequence, said packaging sequence having a 5" and a 3' 
end and linked to one of said ITRs, and iv) a first gene of interest 

20 operably linked to said promoter element; and 

b) a helper adenovirus comprising i) first and a second 
loxP sequences, ii) at least one adenovirus gene coding region, and 
iii) the adenovirus packaging sequence, said packaging sequence 
having a 5' and a 3' end; and 

B) growing said cell line under conditions such that said 
adenovirus gene coding region is expressed and said recombinant plasmid 
directs the production of at least one adenoviral minichromosome. 

19. The method of Claim 18, wherein said adenovirus minichromosome 
is encapidated. 
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20. The method of Claim 18. further comprising 3) recovering said 
encapsidated adenovirus minichromosome. 

21. The method of Claim 20. further comprising 4) purifiying said 
recovered encapsidated adenovirus minichromosome. 

5 22. The method of Claim 21. further comprisng 5) administering said 

purified encapsidated adenovirus minichromosome to a host. 

23. The method of Claim 22, wherein said host is a mammal. 

24. The method of Claim 23, wherein said mammal is a human. 

25. A recombinant adenovirus comprising the adenovirus E2b region 
10 having a deletion, said adenovirus capable of self-propagation in a packaging cell 

line and said E2b region comprising the DNA polymerase gene and the adenovirus 
preterminal protein gene. 

26. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase gene. 

1 5 27, The recombinant adenovirus of Claim 25, wherein said deletion is 

within the adenovirus preterminal protein gene. 

28. The recombinant adenovirus of Claim 25, wherein said deletion is 
within the adenovirus DNA polymerase and preterminal protein genes. 

29. A mammalian cell line stably and constitutively expressing the 
20 adenovirus El gene products and the adenovirus DNA polymerase. 

30. The cell line of Claim 29. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 
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31. The ceil line of Claim 30. wherein said deletion within said E2b 
region comprises a deletion within the adenoviral DNA polymerase gene. " 

32. The cell of Claim 29. wherein the genome of said cell line contains 
a nucleotide sequence encoding adenovirus DNA polymerase operably linked to a 

5 heterologous promoter. 



The cell line of Claim 32, wherein said cell line is selected from the 
group consisting of the B-6, B-9. C-1, C-4. C-7, C-13, and C-14 cell lines. 

34. The cell line of Claim 29 further constitutively expressing the 
adenovirus preterminal protein gene product. 

35. The cell line of Claim 34, wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 

36. The cell line of Claim 35. wherein said deletion within said E2b 
region comprises a deletion within the adenoviral preterminal protein gene. 

' ^ The cell line of Claim 36. wherein said deletion within said E2b 

region comprises a deletion within the adenoviral DNA polymerase and preterminal 
protein genes. 

38. The cell line of Claim 34. wherein the genome of said cell line 
contains a nucleotide sequence encoding adenovirus preterminal protein operably 

20 linked to a heterologous promoter. 

39. The cell line of Claim 38. wherein said cell line is selected from the 
group consisting of the C-l. C-4. C-7. C-13, and C-14 cell lines. 
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40. A method of producing infectious recombinant adenovirus particles 

containing an adenoviral genome containing a deletion within the E2b region, 
comprising 

a) providing: 

5 i) a mammalian cell line stably and constitutively 

expressing the adenovirus El gene products and the adenovirus DNA 
polymerase: 

ii) a recombinant adenovirus comprising a deletion within 
the £2b region, said recombinant adenovirus being capable of self- 
10 propagation in said cell line; 

b) introducing said recombinant adenovirus into said cell line 
under conditions such that said recombinant adenovirus is propagated to 
form infectious recombinant adenovirus particles; and 

c) recovering said infectious recombinant adenovirus particles. 

15 41. The method of Claim 40, further comprising d) purifying said 

recovered infectious recombinant adenovirus particles. 

42. The method of Claim 41, further comprising e) administering said 
purified recombinant adenovirus particles to a host. 

43. The method of Claim 40, wherein said mammalian cell line further 
20 constitutively expresses the adenovirus preterminal protein. 

44. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said £2b sequences containing a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

45. The recombinant plasmid of Claim 44, wherein said deletion 
25 comprises a deletion of nucleotides 8772 to 9385 in SEQ ID N0:4. 

46. The recombinant plasmid of Claim 45, wherein said plasmid has the 
designation pApol. 
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47. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pBHG I Mpol. 



48. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 
within the preterminal protein gene, said deletion resulting in the inability to 
e-xpress functional preterminal protein without disruption of the VA RNA genes. 



5 



49. The recombinant plasmid of Claim 48, wherein said deletion within 
said preterminal protein gene comprises a deletion of nucleotides 10,705 to 1 1,134 
in SEQ ID N0:4. 

50. The recombinant plasmid of Claim 49. wherein said plasmid has the 
designation pApTP. 

51. The recombinant plasmid of Claim 49, wherein said plasmid has the 
designation pBHGUApTP. 

52. The recombinant plasmid of Claim 48 further comprising a deletion 
15 within the polymerase gene, said deletion resulting in reduced polymerase activity. 

53. The recombinant plasmid of Claim 52, wherein said deletion within 
said polymerase and said preterminal protein genes comprises a deletion of 
nucleotides 8,773 to 9586 and 11,067 to 12,513 in SEQ ID NO:4. 

54. The recombinant plasmid of Claim 53, wherein said plasmid has the 
20 designation pAXBApolApTPVARNA+tl3. 

55. The recombinant plasmid of Claim 53, wherein said plasmid has the 
designation pBHGnApolApTPVARNA+tl3. 
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